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Description 

This invention relates generally to human papilloma virus, and more particularly, relates to nucleotide sequences 
of short strands of human papilloma virus which can be amplified and/or used to determine the presence of human 
5 papilloma virus products in a test sample, and some of which also can be amplified and/or used to determine the 
specific type of human papilloma virus of types 16 and 18 present in the test sample. 

Human papilloma virus (HPV) ) is recognized as a venereally-transmitted disease of the anogenital tract which 
often is associated with the pathogenesis of cervical cancer and its precursor lesions. More than 56 types of HPV have 
been characterized. Of these, at least 21 types infect the anogenital tract. L. Gregoire et a!., J. Clin. Micro 27 (12): 

10 2660- 2665( 1989). These mucosotropic viruses are associated most frequently with benign condyloma or latent in- 
fections. However, the presence of HPV in premalignant lesions and invasive cancers, particularly of the cervix, may 
reflect the oncogenic potential of these viruses. See P. M. Howley, in Important Advances in Oncology , D. T. Devlta, 
Jr. et at., eds., J. B. Lippincott, Philadelphia, PA (1987) at pages 55-73. 

Certain HPV types, namely, HPV type 16 and type 18, and to a lesser extent HPV types 31, 33 and 35, are found 

'5 in a high proportion of invasive cervical cancers and their metastases. However, many HPV types which infect the 
anogenital tract, such as HPV types 6 and 1 1 , are found most commonly in benign condyloma and only rarely are found 
in invasive cancers. HPV detected in the anogenital tract can be classified broadly as low risk papilloma viruses ( HPV 
types 6 and 11), intermediate risk papilloma viruses ( HPV types 31 , 33 and 35) or high risk papilloma viruses ( HPV 
types 16 and 18), based on the association of the particular HPV type with malignancy. A. T. Lorincz et al., J Nat' I 

20 Cancer Inst. 79:671 (1987). Thus, the detection of the presence of HPV and the determination of the specific type of 
HPV can provide a diagnostic and prognostic tool useful for determining the clinical significance associated with certain 
HPV types. The early detection of HPV by sensitive and specific reagents and methodologies also could provide earlier 
therapeutic management and counseling. 

A need therefore exists for accurate and reliable methods to identify and type HPV in clinical specimens. However, 

25 known polyclonal antisera prepared by immunizing animals with disrupted virions are capable of detecting H PV antigens 
in only about 30-70% of cutaneous and mucosal warts. Further, the antisera are broadly cross-reactive. Available 
immunological tests have two major drawbacks. First, only well-differentiated cells apparantly are capable of viral 
antigen expression. HPV-infected tissues which show higher degrees of neoplasia, such as carcinoma in situ , rarely 
contain HPV antigen. Thus, the further the development of the malignancy, the smaller the amount of detectable virus 

30 jn the tested tissue. Secondly, these immunological tests are unable to identify specific viral types. 

It is known that papilloma viruses share amino acid sequences in the major capsid proteins. See, for example, C. 
C. Baker, in The Papovaviridae (Vol. 2), P. M. Howley and N. P. Salzman, eds., Plenum Publ. Corp., New York (1987) 
at pages 321 -385. The DNAs of this virus cross-hybridize, indicating homologous sequences. M. F. Law et al. , J. Virol. 
58:225-229 (1979). Thus, molecular hybridization techniques have been developed as a more sensitive and specific 

35 means of detecting and differentiating HPV DNA and RNA in clinical specimens. See A. T, Lorinez, Obstetrics and 
Gynecol. Clinics of N. America 14:451 ( 1987). 

Sequences specific for the DNA and RNA of human papilloma virus are known and have been published. See, for 
example, PCT application No. WO 89/69940 published October 1 9, 1 989, PCT application No. WO 86/0581 6 published 
October 9, 1986 and European Patent Application No. 0 301 968 published February 1, 1989. 

^o The molecular hybridization techniques used to detect homologous DNA sequences are sensitive and can be 

highly specific if used with probes which bind to nucleic acid sequences which are unique to a particular HPV type. 
However, the concentration of total viral DNA in a given clinical sample may be below the limit of sensitivity of the test. 
For example, the amount of viral DNA in dysplastic cervical lesions is reduced with increasing dysplasia. 

To overcome this problem of sensitivity, viral DNA sequences can be amplified by using, for example, the polymer- 

45 ase chain reaction (PCR) or the ligase chain reaction (LCR) techniques. The products thus obtained can be identified 
by using conventional hybridization techniques for identification of virus types, such as Southern blotting. See C. Oste, 
Biotechniques 6:163( 1988), K. B. Mullis, U. S. Patent No. 4,683,202, and EP-A-320 308 (BioTechnica). 

Both PCR and LCR serve to amplify the DNA present in a test sample to detectable levels. In practice, the level 
of sensitivity is about 50 to 100 copies per sample. The next most sensitive technique is dot-blot, which can detect 

50 about 10,000 molecules, while Southern blot reliably detects about 100,000 copies of DNA per sample. 

Thus, the appropriate diagnosis of HPV may require two steps. In one strategy, the presence of a clinically relevant 
type of HPV is first detected with a group-specific primer. After the presence of HPV is detected, differentiation between 
types can be performed by using a type-specific probe having low homology between the HPVs of the group. Alterna- 
tively, differentiation can be performed using a mixture of type-specific probes at the outset, provided these probes 

55 amplify DNA independently of each other, and that they can be detected independently. In the past, such tasks were 
attempted using specific antibodies. In general, nucleic acid probes and primers allow greater discrimination among 
subtypes than do antibodies. The use of DNA-based tests increases both sensitivity and specificity over prior-art an- 
tibody-based tests. 
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It therefore would be advantageous to provide oligonucleotide strands of DNA which could be amplified and used 
to detect the presence, if any, of HPV in a test sample. It also would be advantageous to provide short oligonucleotide 
strands of DNA which could be amplified and used to detect the presence, if any, of specific types of HPV in the test 
sample. The combined use of oligonucleotide strands would be advantgeous for allowing for the specific and sensitive 
5 in vitro diagnosis of the presence and specific type of HPV present in test samples. 

SUMMARY OF THE INVENTION 

Oligonucleotides of from about 10 to about 60 nucleotides are provided which can be amplified and used either to 
10 detect specific sequences of specific types of human papilloma virus, or consensus regions with high homology among 
different types. The presence of HPV is determined by contacting the test sample with sequences provided to detect 
the presence, if any, of HPV types 6, 11, 16, 18, 31, 33 and 61. This may be done with or without prior amplification, 
for example, by PCR or LCR. Either type-specific or consensus amplification is also possible. Two oligonucleotides 
are provided if the sequence is to be amplified by PCR, and four oligonucleotides provided if amplification is by LCR, 
75 in accordance with these known amplification procedures. After the presence of HPV is detected, the type of HPV 
present in the sample can be determined by using HPV type-specific probes, by subsequent rounds of PCR, or by 
LCR. Alternatively, the presence of type-specific HPV can be determined by contacting the test sample directly with 
type-specific nucleotide sequence provided by the invention for the detection of HPV types 16 and 18. ,.,so provided 
are methods for using the oligonucleotides and kits for amplifying and detecting the presence of human papilloma virus. 

20 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is a photograph of a gel following electrophoresis showing the results when the primers PCR 1 and PCR5 
were used to amplify selected plasmids wherein HPV 6 is in lane 1, HPV 11 is in lane 2, HPV 16 is in lane 3, HPV 18 
25 is in lane 4, and HPV 31 is in lane 5, HPV 33 is in lane 6, HPV61 is in lane 7, and molecular weight standards are in lane 8. 

FIG. 2 is a photograph of a gel following electrophoresis showing the results when the primers PCR 1, PCR2, 
PCR3, PCR4 and PCR5 were used to amplify plasmid p65.16.8 (HPV 16). PCR1 and PCR5 are primers according to 
the invention. 

FIG. 3 is a photograph of the ethidium bromide-stained gels wherein PCR 1 4 and PCR15 are used in conjunction 
30 with IWDO to obtain amplified PCR product. 

FIG. 4 is a graph of results obtained from performing LCR on 10 7 molecules of the selected target using LCR5A, 
LCR5A', LCR5B and LCR5BV The rate of reaction of 4-methyl lumbelliferone is expressed as fluorescence counts/ 
second/second and plotted against the target HPV type. 

FIG. 5 is a graph of results obtained from performing LCR on 10 7 molecules of the selected target using LCR6A, 
•35 LCR6A', LCR6B and LCR6BV The rate of reaction of 4-methyllumbelliferone is expressed as fluorescence counts/ 
second/second and plotted against the target HPV type. 

FIG. 6 is a graph of results obtained from performing LCR on 10 7 molecules of the selected target using LCR7A, 
LCR7A', LCR7B and LCR7B'. The rate of reaction of 4-methyllumbelliferone is expressed as fluorescence counts/ 
second/second and plotted against the target HPV type. 
40 FIG. 7 is a graph of results obtained from perfomring LCR on 10 7 molecules of the selected target using LCR8A, 

LCR8A', LCR8B and LCR8B'. The rate of reaction of 4-methyllumbelliferone is expressed as fluorescence counts/ 
second/second and plotted against the target HPV type. 

DETAILED DESCRIPTION OF THE INVENTION 

45 

The appropriate diagnosis of HPV requires two sets of conditions. The first enables the detection of all pertinent 
types, and the second set allows differentiation among them. In the past, such tasks have been attempted using specific 
antibodies. In general, nucleic acid probes and primers allow greater discrimination among subtypes than do antibodies. 
Thus, the use of DNA-based tests tends to increase both sensitivity and specificity over antibody -based tests. 

50 u. S. Patents No. 4,683,195 and 4,683,202 teach a method of amplifying DNA sequences by using PCR. This 

method now is a standard procedure in many molecualr biology laboratories. Examples 1-3 which follow below utilize 
the procedures taught in these two patents and the method as described in the package insert of the commercially- 
available Gene-Amp™ kit (Document No. 55635-6/89, Perkin-Elmer/Cetus, Emeryville, CA). 

In PCR, two complementary polynucleotide strands are amplified by treating the strands with two oligonucleotide 

55 primers such that an extension product of each primer is synthesized which is complementary to each nucleic acid 
strand. The primers are selected such that the extension product of one primer forms a template for the synthesis of 
an extension product from the other primer once the extension product of the one primer is separated from the template. 
A chain reaction is maintained by a cycle of denaturing the primer extension products from their templates, treating 
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the single-stranded molecule generated with the same primers to re-anneal, and allowing the primers to form further 
extension products. The cycle is repeated for any many times as it takes to increase the target nucleic acid segments 
to a concentration where they can be detected. 

The amplified target sequence can be detected by any of several known techniques; for example, by denaturing 
5 the double-stranded products formed by PCR, and treating those products with one or more reporter probes which 
hybridize with the extension products. The reporter probe has a detectable label, and usually is added in excess. The 
unhybridized reporter probe, therefore, must be separated from the hybridized reporter probe by involving a separation 
step. In another method of detecting the extension products without reporter probe and a separation step, the extension 
products are detected by gels stained with ethidium bromide. The diagnosis can be confirmed by transferring the DNA 

10 to nitrocellulose and probing with a probe specific to the HPV type suspected of being present in the sample. 

Alternately with PCR, one may take advantage of known restriction sites within the HPV DNA to demonstrate that 
the amplified DNA contains the expected sequence by examining the cleavage pattern(s) generated with one or more 
restriction endonucleases. Verifying the authenticity of the amplified sequence may be necessary for two reasons: (1) 
to ensure that sequences complementary to the amplifying primers are not fortuitously present in cellular DNA which 

'5 does not contain HPV DNA, and (2), to identify the type of HPV present in the sample. If the sequences chosen for 
amplification are conserved among HPV types, then the finding of an amplified product does not implicate a particular 
HPV type. It also should be possible to predict the size of the amplified product based on the binding positions of the 
two primers. Thus, when that product is found, one reasonably can be assured that HPV is present. However, two 
different types of HPV may give the same or different size products. Thus, hybridization should be used to confirm the 

20 identity of the amplified sequence until confidence is built that the interpretation of the results is reliable. It should be 
pointed out that the PCR technique will identify only closely related, or type-specific sequences in the absence of highly 
homologous primers, since only a small portion of the genome is analyzed. 

Another particularly useful detection technique is described in EP-A-357 011. In this method, a different reporter 
molecule, e.g. hapten, is attached to each primer. Following amplification, but before denaturation, duplexes can be 

25 detected by "capturing" one hapten (haptenl) with a solid phase coated with anti-haptenl. The separated complex 
can be detected with a conjugate of label and anti-hapten2, and label associated with the solid phase can be measured. 

The Ligase Chain Reaction (LCR) amplifies sections of DNA by copying the section of DNA, and copying the 
copies of that section of DNA, many times over. This method is described in European Patent Application No. 0 320 
308 published June 14, 1989, which is incorporated herein by reference. In this procedure, two probes (for example, 

30 a and B) complementary to immediately adjacent regions of a target sequence are hybridized and ligated. This ligated 
probe then is denatured away from the target, after which it is hybridized with two additional probes (A' and B') of sense 
opposite to the initial probes A and B. The secondary probes are themselves then ligated. Subsequent cycles of de- 
naturation/hybridization/ligation create the formation of double-length probes of both sense (+) and antisense (-). 
In LCR, the nucleic acid of the sample is provided either as single stranded DNA or as double-stranded DNA which 

35 is denatured to separate the strands. Four probes are utilized: the first two probes (A and B) are the so-called primary 
probes, and the second two probes (A 1 and B') are the so-called secondary probes. The first probe (A) is a single strand 
capable of hybridizing to a first segment of the primary strand of the target nucleotide sequence. The second probe 
(b) is capable of hybridizing to a second segment of the primary strand of the target nucleotide sequence. The 5" end 
of the first segment of the primary strand of the target is positioned relative to the 3' end of the second segment of the 

40 primary strand of the target to enable joining of the 3' end of the first probe to the 5' end of the second probe, when 
the probes are hybridized to the primary strand of the target nucleotide sequence. The third probe (A') is capable of 
hybridizing to the first probe, and the fourth probe (B') is capable of hybridizing to the second probe (B). The hybridized 
probes are ligated to form reorganized fused probe sequences. Then, the DNA in the sample is denatured to separate 
ligated probes from sample DNA. Successive cycles wherein the ligated probes and target DNA undergo the above- 

45 described process are performed to increase the amount of detectable DNA in the sample. The amount of cycles 
performed is dependent upon the sequence used and the sensitivity required of the test. Usually, the cycle can be 
repeated from 15 to 60 times. At least one of the probes can be conjugated to a signal generating compound. 

If the four probes are conjugated to appropriate binding members, the detection of amplified product can be ac- 
complished using standard manual or automated immunoassay procedures known to those skilled in the art. These 

50 procedures include, for example, immunochromatography, ELISA, EIA and MEIA. Hybridization also can be accom- 
plished by following standard dot-, slot- or replica-blot procedures which are known to those in the art. The sequences 
can be labelled with an appropriate signal generating compound (label), which is capable of generating a measureable 
signal detectable by external means. The various signal generating compounds contemplated include chromogens, 
catalysts such as enzymes, luminescent compounds such as fluorescein and rhodamine, chemiluminescent com- 

55 pounds, radioactive elements such as 32 P, and other labels known to those of ordinary skill in the art. The selection of 
a particular label is not critical, but it will be capable of producing a a signal either by itself or in conjunction with one 
or more additional substances. A variety of different indicator reagents can be formed of label and specific binding 
member. Either the label or a specific binding member can be varied. Examples of specific binding members which 
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can be used as a member of the indicator reagent include antibodies, both monoclonal, polyclonal, and fragments 
thereof; avidin or biotin, biotin and anti-biotin, a carbohydrate or a lectin, a complementary nucleotide sequence, an 
effector or a receptor molecule, an enzyme cofactor or an enzyme; an enzyme inhibitor or an enzyme; also any antigenic 
substances, haptens, antibodies, and combinations thereof. 
s The test sample can be any biological material suspected of containing HPV. Thus, the test sample can be human 

body tissue, or a test sample which contains cells suspected of containing HPV. 

The invention will now be described by way of Examples, which are meant to describe, but not to limit, the spirit 
and scope of the invention. 

The following terms used in the examples are trademarks, tradenames or chemical abbreviations as specified: 

10 

TRIS - chemical abbreviation for [tris(hydroyxmethyl)aminomethane], used as a buffer. 

EDTA - chemical abbreviation for ethylenediaminetetraacetic acid, a chelating agent. 

FITC - chemical abbreviation for fluorescein isothiocyanate, aflourescent hapten derivative. 

NHS-ester - chemical abbreviation for N-hydroxysuccinamide ester 
T5 MES - chemical abbreviation for [2-(N-morpholino)ethanesulfonic acid], a buffer. 

TWEEN®-20 - trademark of Atlas Chemical for polyoxyethylene sorbitan monolaurate, a detergent. 

BIS-TRIS - chemical abbbreviation for [bis-(2-hydroxyethyl )-amino]tris-(hydroxymethyl)methane, a buffer. 

TRITON X- 100® - trademark of Rohm & Haas for nonaethylene glycol octylphwhol ether, a detergent. 

IMx® - trademark of Abbott Laboratories for an automated instrument for performing microparticle enzyme immu- 
20 noassay (ME I A). 

EXAMPLES 

EXAMPLE 1 

25 

PCR was performed essentially following the package insert of the commercially available Gene-Amp™ kit (doc- 
ument No. 55635-6/89, available from Perkin-Elmer/Cetus, Emeryville, CA). The following reagents were mixed in a 
0.5 mL polypropylene tube and used in performing PCR: 



30 


Reagent 


Final Concentration 




Water 


(to give final volume = 50 or 100 uX) 




Reaction Buffer 


10 mM TRISpH 8.3 






50 mM KC1 


35 




1.5 mM MgC12 






0.01% gelatin 




dNTP mixture 


200 u.M each of dATP.dCTP.dGTP, and TTP 




pCR1 


1 u.M 


40 


pCR2 


1 u,M 




plasmid 


10|iL 1 ng/100uA. 




(or control-human placental DNA (Pooled Placental DNA, catalog D-3287, Sigma Chemical Co, St. Louis MO). 




DNA polymerase, 




45 


Thermus Acquaticus 


25 or63.9 units/1 mL 



After mixing, the reaction mixture was overlayed with 100 u.L of mineral oil. The tube then was placed in an instru- 
ment capable of incubation at several temperatures, and subjected to 30 or 40 cycles of programmed temperature 
change. The precise cycle of temperature change used, and the instrument used, varied with the experiment, and is 
detailed in the descriptions of the figures in Example 3. 



EXAMPLE 2 

Following the procedure of Example 1 , the following sequences were found to amplify sections of papilloma virus 
types 6, 11, 16, 18, 31, 33, and 61 using PCR. 

PCR 1 : CAGATGTCTC TGTGGCGGCC TAGTG (ID No. I) 
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PCR5: AGGTGTCAGG AAAACC AAAT TTATT (lONo. 5) 

PCRM: GAATTAGTTA GACCATTTAA AAG (ID No. 6) 

5 

PCR1S: GGGGAAACAC CAGAATGGAT A (ID No. 7) 

IWDO: ATCAT ATGCC CACTGTACCA T (ID No. 8) 

Sequence IWDO is derived from a sequence disclosed in International application number PCTAJS86/00629 (WO 
10 86/05816). 

TABLE 1 shows the sequences and where they map to to in the various types. 



75 

TABLE I 

SEQUENCES WHICH CAN BE USED AS PROBES OR PCR PRIMERS 



)PW»E 5EQ SEOUENCE SENSE MAPS TO: MAPS TO: MAPS TO: MAPS TO: MAPS TO: MAPS TO: 

ID NO. 



25 







(type6) 


(type 11) 


(type 16) 


(type 16) 


(type 31) 


(type 33) 


PCR 1 : 


1 CAGATGTCTCTOTOOCCCCCTAGTG 


♦ 5706-5610 


5768-5792 


5634-5658 


5610-5634 


5550-5574 


5591-5615 


PCR2: 


2 CGTTTTCCATATTTTTTTGCAGATG 


♦ 5767-5791 


5749-5773 


615-5639 


5591-5615 


5531-5555 


5572-5596 


0PCR3: 


3 AAGTTGTAAOCACCGATOAATATGT 


♦ 5844-5868 


5826-5850 


695-5719 


S67 1-5695 


5611-5635 


5652-5676 


PCR 4: 


4 AATGTACCCTAAATACCCTATATTO 


- 6008-5984 


S990-5966 


865-5841 


5841-5817 


5784-5760 


5825-58C1 


PCR5: 


5 AGGTGTCAGGAAAACCAAATTTATT 


- 6044-6020 


6026-6002 


5901-5877 


5877-5853 


5620-5796 


5861-5837 


PCR 14: 


6 GAATTAOTTAOACCATTTAAAAG 


♦ 1495-1517 


1495-1517 


1524-1546 


1595-1617 


1462-1484 


1518-1540 


PCR 15: 


7 GGGGAAACACCAGAATGGATA 


♦ 1834-1854 


1834-1854 


1863-1883 


1934-1954 


1801-1821 


1857-1877 


51WD0: 


8 ATCATATGCCCACTGTACCAT 


- 1931 -1911 


1931-1911 


1960-1940 


2031-2011 


1898-1878 


1954-1934 



35 

note: PCR2, PCR3 and PCR4 are not probes or PCR primers of the invention 

EXAMPLE 3 

40 

Linearized plasmids containing full-length papilloma virus inserts in pGEM3 were used as targets. These were 
PHPV6.1 ( HPV6), pSP65.11. 5 ( HPV 11), p65.16,8 (HPV16), pHPV18H(HPV18), pG3 HPV31 ( HPV31), 
pLNK322.HPV33 ( HPV33), and pBR322.HPV61 (HPV61). The Programmable Cyclic Reactor™ (available from Eri- 
comp, San Diego) was used as the incubation instrument. Following PCR procedures as described in Example 1,10 
4*5 u>l aliquots were analyzed by electrophoresis through agarose (comprising a 5:3 ratio of NuSieve®:SeaKem® GTG, 
available from the FMC Corp., Rockland, ME) in a buffer comprising 0.089 M TRIS, 0.089 M borate, 2 mM EDTA, and 
0.5 ppt ethidium bromide. 

FIG. 1 is a photograph of an ethidium bromide-stained 1 .2% agarose gel showing results using 63.9 units/mL DNA 
polymerase, in the DNA Thermal Cycler™ (Perkin-Elmer/CETUS, Emeryville, CA). The samples were heated for 5 

so minutes at 94°C, then subjected to 40 cycles of a temperature program of: 1 minute at 94°C, 2 minutes at 40°C, and 
1 .5 minutes at 72°C. The PCR primers used in this case were PCR 1 and PCR5 of Example 2. Examination of the gel 
following electrophoresis showed bands at the expected positions, i.e. 292 bp. Lane 1 , HPV6; lane 2, HPV 11; lane 3, 
HPV16 ; lane 4, HPV 18; lane 5, HPV31; lane 6, HPV33, lane 7, HPV61; lane 8, pooled human placental DNA ( sus- 
pected of having HPV infection); lane 9, molecular weight markers-Hae III digest of OX174. 

55 FIG. 2 is a photograph of an ethidium bromide-stained 4% agarose gel showing results using 25 units/mL DNA 

polymerase, in the Programmable Cycler Reactor™ ( Ericomp, San Diego, CA). Samples in this case were subjected 
to 30 cycles of a temperature program of: 50°C for one (1) minute, 72°C for two (2) minutes and 95°C for one (1) ) 
minute. In this case, the primers PCR1 , PCR2, PCR3, PCR4 and PCR5 of Example 2 were used to amplify plasmid 
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p65,16,8(HPV 16). Examination of the gel of Figure 2 shows bands at the expected positions, i.e., PCR 1 and PCR4, 
235 bp, lane 2; PCR1 and PCR5, 267 bp, lane 4; PCR2 and PCR4, 254 bp, lane 6; PCR2 and PCR5, 286 bp, lane 8; 
PCR3and PCR4, 174 bp, lane 10; PCR3and PCR5, 206 bp, lane 12; molecular weight marker, 123, 246, 369, 492,... 
bp ladder, lane 1 . Note footnote to Table 1 . 

5 FIG. 3 is a photograph of an ethidium bromide-stained 1 .2% agarose gel showing results using the same conditions 

as FIG. 1 . In this case, PCR14 and PCR15 were used as primers in conjunction with IWDO. The expected size of the 
amplified PCR product of PCR 1 4 and IWDO is 437 bp for all of the HPV types tested. The expected size of the product 
of PCR 15 and IWDO is 98 bp. Products of these sizes appear in the gels, confirming that PCR14 and PCR15, used 
in conjunction with IWDO, will amplify HPV DNA of types 6, 11, 16, 18, 31 , 33, and 61. Lane 1, Molecular weight 

10 marker (Hae III digest of FX 174); PCR 14 + IWDO, lanes 2-9: lane 2, HPV6 ; lane 3, HPV 11 ; lane 4, HPV16 ; lane 
5, HPV18 ; lane 6, HPV31; lane 7, HPV33; lane 8, HPV61 ; lane 9, human placental DNA suspected of being infected 
with HPV; PCR 5 + IWDO, lanes 10-17: lane 10, HPV6 ; lone 11 , HPV 11; lane 12, HPV16; lane 13, HPV18 ; lone 14, 
HPV31 ; lane 15, HPV33 ; lane 16, HPV61 ; lane 17, human placental DNA suspected of being infected with HPV; lane 
18, molecular weight marker ( Hae III digest of FX174 and HinD III digest of 1 DNA) . 

75 

EXAMPLE 4 

The following reagents were mixed in a 0.5 mL polypropylene »ube as follows for the Ligase Chain Reaction (LCR): 



Reagent 


Volume 


Final Concentration 


Water 


21 u,L 




Reaction Buffer 


10uJ_ 


50 mMEPPS pH7.8 






10 mM NH 4 CI 






10 mM MgCI 2 






100 mM K + (from all sources) 






0.001% BSA 






1 mM DDT 


Nicotine Adenine Dinucleotide (NAD) 


0.5 u,L 


100 [lL 


ProbeA (sense) 


4uJ_ 


5.0 x 10 11 molecules 


ProbeA' (antisense, 5'-phosphate) 


4uX 


7.5 x 10 11 molecules 


ProbeB (sense, 5'-phosphate) 


4u.L 


7.5 x 10 11 molecules 


Probe B' (antisense) 


4uL 


5.0 x 10 11 molecules 


Target (including human placental carrier DNA at 10 u.g/mL) 


1.5fiL 


15ng/50 uX 


DNA liqase, Thermus therDODhilus 


1U.L 





This reaction mixture was overlayed with 30 uX of mineral oil. The tube was placed in an instrument capable of 
incubation at several temperatures ( e.g. thermal cycler from Coy Laboratory Products (Ann Arbor, Ml) or the Program- 
mable Cycler Reactor™ (available from Ericomp, San Diego, CA), and then subjected to several cycles of programmed 
temperature change. Each cycle involved incubation at 50°C for one minute and 85°C for one minute. 

EXAMPLE 5 

The following procedure was used when performing the Ligase Chain Reaction ( LCR), which is described in 
published European Patent Application No. 0 320 308 A2. The reagents of Example 4 were utilized in the procedure 
as follows: Two probes (A and B) complementary to immediately adjacent to regions of a target sequence were hy- 
bridized and ligated. This ligated probe was denatured away from the target, and hybridized with two additional probes 
(A* and B') of sense opposite to the initial probes (A and B). The secondary probes then were ligated. Subsequent 
cycles of denaturation/hybridization/ligation created the formation of double-length probes of both + and - sense. 

EXAMPLE 8 

The following sequences were determined to be specific for a portion of the E6 region of HPV type 16: 
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P rotas 


5EQ ID NO- 


Seouence 






MflQS tQ. 


LCR5A 


81 


GCTGCAAACA 


ACTATACATG 


ATATAA 


157 - 182 


LCR5A' 


82 


pTTATATCATG 


TATAGTTGTT 


TGCAGC 


182 - 157 


LCRSB 


83 


pTATTAGAATG 


TGTGTACTGC 


AAGCA 


183 - 208 


LCR5B' 


84 


TGCTTGCAGT 


ACACACATTC 


TAATA 


208 - 157 



70 

EXAMPLE 9 

Base-denatured plasmids which contained full-length papilloma virus inserts in pGEM3 were used as targets. 
These plasm ids were pG3HPV6(+) (HPV6), pSP 65. 11.5 (HPV11), pSP65.168 (HPV16), p63HPV18H(-)(HPV18), 

15 p63:HPV31 (HPV31), pLNK322:HPV33 (HPV33), pBR322:HPV35 (HPV35), pUCl9:HPV52 (HPV52), pLNK322: 
HPV58 (HPV58), pUC9:HPV59 (HPV59) and PBR322:HPV61 (HPV61). All of the oligonucleotides used as probes 
from Example 8 had chemical labels covalently attched at the ends distal from ligation. These labels were: 5'-fluores- 
cein-LCRSA, S'-fluorescein-LCRSA', 3'- biotin-LCR5B and 5'-biotin-LCR5B'. Covalent attachment was performed by 
known methods, i.e., reaction of amine-terminated oligonucleotides with FITC or biotin-NHS-ester essentially following 

20 the procedures of Kansal et al. Tet. Letters 29:5537-5540 (1988). The thermal cycler used was obtained from Coy 
Laboratory Products, Ann Arbor, Ml. 

Following the LCR procedure of Examples 4 and 5, the mixtures were analyzed using a prototype version of the 
IM X ® instrument (Abbott Laboratories, Abbott Park, IL), following the protocol for microparticle enzyme immunoassays 
as follows. A 40jiL aliquot of an LCR mixture was diluted 1:1 with distilled water. This diluted mixture was incubated 

25 with SO^L antifluorescein-conjugated polystyrene microparticles for five (5) minutes to form a suspension of immune 
complexes on the microparticles. This suspension then was transferred to an inert glass fiber matrix, to which the 
microparticles became attached. The matrix was washed with buffer (0.3M Nad, 10 mM TRIS pH8, 0,1%NaN 3 ), Any 
immune complexes attached to the glass matrix was detected by using alkaline phosphatase-labeled conjugate that 
catalyzed the hydrolysis of 4-methylumbelliferone. The rate at which the 4-methylumbelliferone was generated on the 

30 matrix was proportional to the concentration of LCR product formed in the reaction mixture. 

Referring to FIG. 4, the graph shows the results obtained from performing LCR on 10 7 molecules of the targets in 
shown. The rate shown is the rate of generation of 4-methylumbelliferone, and is expresssed as fluorescence counts/ 
second/second. Background signal is approximately 10 c/s/s, as shown by the amplification of human placental DNA. 
The only values above background are those for sample containing HPV16, and those values are about 60 times 

35 background signal. 

EXAMPLE 10 

The following sequences were determined to be specific for a portion of the E6 region of HPV type 18: 

40 



Probe 


§EQIpNp. 








Maos to; 


LCR6A 


85 


CTTCACTGCA 


AGACATACAA 


ATAA 


172 - 195 


LCR6A' 


86 


pTTATTTCTAT 


GTCTTGCAGT 


GAA 


195 - 173 


LCR6B 


87 


pCCTGTGTATA 


TTGCAAGACA 


GTAT 


196-219 


LCR6B' 


88 


TACTGTCTTG 


CAATATACAC 


AGG 


218 - 196 



EXAMPLE 11 

so 

Plasmids which contained full-length papilloma virus inserts in pGEM3 were used as targets. The plasmids used 
were those described in Example 9 . All of the oligonucleotides used as probes obtained from Example 1 0 had chemical 
labels covalently attached at the ends distal from ligation. The thermal cycler was obtained from Coy Laboratory Prod- 
ucts, Ann Arbor, Ml. 

55 Following LCR procedure described in Examples 4 and 5, the mixtures were analyzed as described in Example 9 

using the prototype version of the IM X ® instrument (Abbott Laboratories, Abbott Park, IL). 

Referring to FIG. 5, the graph dislays the results obtained from performing LCR on 10 7 molecules of the targets. 
The rate shown is the rate of generation of 4-methylumbelliferone, and is expressed as fluorescence counts/second/ 
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second. Background signal is approximately 15 c/s/s, as shown by the amplification of human placental DNA. The only 
values above background are those tor sample containing HPV 8, and those values are about 40 times background 
signal. 

EXAMPLE 12 

The following sequences were determined to be specific for a portion of the E6 region of HPV type 18: 



Probe 


SFQ ID NO 


Seauence 






Macs to;. 


LCR7A 


89 


TATATTGCAA 


GACAGTATTG 


GAAC 


200 - 223 


LCR7A- 


90 


pGTTCCAATAC 


TGTCTTGCAA 


TTTA 


223 - 200 


LCR7B 


91 


pTTACAGAGGT 


ATTTGAATTT 


GCATT 


224 - 249 


LCR7B' 


92 


AATGCAAATT 


CAAATACCTC 


TGTAA 


249 - 224 



EXAMPLE 13 

Plasmids which contained full-length papilloma virus inserts in pGEM3 were used as targets. The plasmids were 
those of Example 9 All of the oligonucleotides from Example 12 which were used as probes had chemical labels 
covalently attached at the ends distal from ligation. The thermal cycler was as described in Example 11. 

Following the LCR procedure of Examples 4 and 5, the mixtures were analyzed as described in Example 9 using 
the prototype version of the IMx instrument (Abbott Laboratories, Abbott Park, IL). 

Referring to FIG. 6 , the graph shows the results obtained from performing LCR on 10 7 molecules of the targets. 
The rate shown is the rate of generation of 4-methylumbelliferone, and is expressed as fluorescence counts/second/ 
second. Background signal is approximately 15 c/s/s, as shown by the amplification of human placental DNA. The only 
values above background are those for sample containing HPV 18, and those values are about 80 times background 
signal. 

EXAMPLE 14 



The following sequences were determined to be specific for a portion of the E6 region of HPV type 16. 



Probe 


SEQ ID N 0l 


Seauence 






Maostn 


LCR8A 


93 


GTATGGAACA 


ACATTAGAAC 


AGCA 


352 - 


375 


LCR8A' 


94 


pTGCTGTTCTA 


ATGTTGTTCC 


ATAC 


375 - 


352 


LCR8B 


95 


pATACAACAAA 


CCGTTGTGTG 


ATTT 


376 - 


399 


LCR8B' 


96 


AAATCACACA 


ACGGTTTGTT 


GTAT 


399 - 


376 



EXAMPLE 15 

Plasmids which contained full-length papilloma virus inserts in pGEM3 were used as targets. All of the oligonucle- 
otides from Example 14 used as probes had chemical labels covalently attached at the ends distal from ligation. The 
thermal cycler was as described in Example 11. 

Following LCR procedure of Examples 4 and 5, the mixtureswere analyzed as described in Example 9 using the 
prototype version of the IMx® instrument (Abbott Laboratories, Abbott Park, IL). 

Referring to FIG. 7 , the graph details the results obtained from performing LCR on 10 7 molecules of the targets. 
The rate shown is the rate of generation of 4-methylumbelliferone, and is expressed as fluorescence counts/second/ 
second. Background signal is approximately 10 c/s/s, as shown by the amplification of human placental DNA. The only 
values above background are those for sample containing HPV 16, and those values are about 36 times background 
signal. 
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EXAMPLE 16 

The attached Appendix (example 16) discloses the sequences of the invention aligned to known sequences. 
EXAMPLE 16 
APPENDIX 

HUMAN PAPILLOMA VIRUS 

ALIGNMENT of TYPES 6. 11 . 16. 18. 31 , and 33; with CONSENSUS SEQUENCE . 



The appendix lists the sequences of HPV types 6, 11, 16, 18, 31, and 33. It also shows where the sequences of 
this invention line up with respect to these HPV sequences. In addition, the appendix shows where other sequences, 
is known to the Inventors as of 28 September 1990, and claimed or disclosed by or unknown to others, line up with 
respect to these sequences. 

1. Sequences and Regions Claimed by Us; 

20 pcr = Sequences per examples 1 through 3 (only PCR1, PCR5 PCR14 and PCR15) 

LCR = Sequences per examples 4 through 14 only 

2. Sequences and Regions Unknown to Others and Not Claimed by Us; 

25 

PCR = Sequences designated PCR other than those above JJ 

LCR = Sequences designated LCR other than those above 

30 3. Sequences and Regions Claimed by Others; 

(Italics represents antisense sequences) 

AUS = International application number (Australians) PCT/AU88/00047 (WO 88/06634) 

35 WL = International application number (Wayne Lancaster, Wayne State University) PCT/US86/00629 

(WO 86/05816) 

BE = European Patent Application (Belgians) 89.033834 (X= T or U) 
^o C= International application number (CETUS) PCT/US89/03747 (WO 90/C2821 ) 

O = International application number (Oncor) PCT/US89/01 318 (WO 89/09940) 
and 

45 

4. Sequences and Regions Disclosed by Others. 

S = Sarkar, F.H. and Crissman. J.D. Biotechniques 9 180-184 (1990) (Italics represents antisense sequences) 

so 



55 
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w 



15 



20 



gTTAATAACAATCTTgGTTTAA AAAAtAGGAGGG ACCGAAA ACG GTTC AACCG AAAA 

Miiiniimii nun mi iiiiiii iiiiiii imiimiiiiiii 

cTTAATAACAATCTTAGTTTAA AAAAgAGGAGGG ACCGAAA ACG GTTCAACCG AAAA 

mi i i inn i mi n mi mini minimum 

gtaaACTATAATgCcaAGTTTTA AAAA AGtAGGGtGTAACCGAAA gCGGTTCAACCGAAAA 

ii him i i ii m im 1 1 1 1 1 1 1 1 1 1 inn immi 

actACAATAAT tcAtGTATA AAA CtaAGGGcGTAACCGAAA tCGGTTGAACCGAAAc 

inn iii mi ii mini i imiimi 

CtTAGTATA AAA AAgtAGGGAGTg ACCGAAA GtGgTGAACCGAAAA 

mini m n mini iiiiiii i i mm 



n 

TAATA ATAATAAT 

inn i i 



atTAATActTttaAcaattgTAGTATAtAAA AA AGGG AGTaAC CGAAAacGg t c gGg AC CG AAAA 

con — taatata-ta-aa-tcttag-T-tA-AAAaaag-AGGGagtaACCGAAA-acggtt-aACCGAAAa 

C 4 - GCCAAS TTGGCTTTT 
C 5 - GCCAGCCCTGGCTTTT 
C3 6 -CCGTTSAACCGAAAA 
C37 -CGGTCGGGACCGAAAA 
C3 8 -CGGTTSAACCGAAAM 
C39-CGGTTCAACCGAAAM 
015 - ATTAATACTTTTAACAATTGTAGTATATAAA AA AGGG AGTAACCGAAAACGGTCGGCACCGAAAA- 015 
024- ACTACAATAAT TCATGTATA AAA CTAAGGG CGTAACCGAAA TCGGTTGAACCGAAAC -02 4 

S 1 -CGGTCGGGACCGAAAA 
S3-ACCGAAAC 



25 



30 



35 



40 



58 



11 58 



33 62 



16 



31 



18 



con 



58 



60 



66 



CGGTT gTATATAAA CCAGCCCtAAAAtTTAGCAaACCAGGCATTATGGAAAGTgcAaATGCCTCCAC 

urn mum mini mi mm imimimmim i iimimi 

CGGTT aTATATAAA CCAGCCCAAAAAaTTAGCAGACGAGGCATTATGGAAAGTaAAGATGCCTCCAC 

mi it ii ii ii iii i i mi i 

CG G T gc aTATA tAAAGCA aACAT TTTgc a g t aAG g tAc t G C AC g AC tATGTTTCAAG ACaC t g AGG A 

mi i m mm mini i in i iiimii m i mi 

CGGTTaGTATA AAAGCA gACATTTTaTGcaCcAAaAgAGAACtGCaATGTTTCCacAGGA 

inn inn mm i n n i n i i n inn i i n n 

CGGTTgGTATATAAAGCAcataGTaTTtTGtgCaAAccTACAgacGCcATGTTcaAaaATCCtgCAGA 

mi iimmi i i i m ii n i mi n 

CGGT GTATATAAA agatgtGagaaacacaCcAcaaTACtatgGCgcgcTTtgAggATCCaaCAcg 



CGGTt-gtatatAAagcag— ca-a— -at— gcaaaca-agcattrcgatgttt-aagAtcC— c-ga 
GCC- C 4 AUS 1 -ATGCCTC C AC 

GCC-C5 

CGG-C 3 6 C 6 7 -AAATCCTGCAGA 

CGG-C37 C68-CCTACAGACGCCATGTTCA-C68 
CGG-C38 C71-GCAGTAAGGTACTGCAC-C7 1 

CGG-C39 010-GGATCCAACACG 
015 -CGGT GTATATAAA AGATGTGAGAAACACACCACAATACTATGGCGCGCTTTGAGGATCC -0 1 5 
02 4 -CGGTTAGTATA AAAGCA GACATTTTATGCACCAAAAGAGAACTGCAATGTTTCCACAGGA ( -02 4 ) 

CGGTG-S 1 S 2 - CCGCGCGAAACTCCTAGGTTGTGC-S 2 

CGGTTAGTATA AAAGC-S3 



45 



SO 



55 
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10 



6 125 CTCTCCAACgaCcATAGACCAGTTGTGCAAGACGTTTAATCTaTCTaTGCAtACqtTGCAAATTaAtT 

„ ,,«jLUUJ^JLUU lm ni ,,m 1 iii mi ii iii i i 

11 125 GTCTGCAAC^tCtATAGACCAGTTGTCGAAGACG 

33 129 aa aAC CAcG AAC aSTgCAt gAtTTGTGCcAACCaTTG 9Ag ACAACTA? A^CAAcAT t gA^T ACAGT 
oi in II I I II III II II I lllllllllll I II I II I I 
16 124 9 c G^CcaGAAAgTTaCcacAgTTATGCaCAGagcTGcAAACAACTATACATGAtATAAtATTAqAaT 

UK I HIM I II II | I II III I I (III III Ml | | 

31 128 aaGACCtcGgAAaTTgCaTGAaCTAaGCtCGGcAtTGgAAAtAcCctacgATGAacTAAgATTQAAtT 

I'll II I I HI H II HI Mill I Mill I II I 

gcGACCctacAAgcTaCcTGAtCTgtGCaCGGaAcTGaAcActtCactgcAaGAcaTAgaAaTaAccT 



18 131 



75 



20 



con • g-gaccaagaa— tTacat-Ag*TgtGCa-ggc-tTgaA-a-atCtatgcAt-a-aTa-aAaTaaa-T 
. GTCTGCAAC -AUS 1 AUS 7 -GCAAGACGTTTAATCT-AUS 7 

. AAGACCTC-C 6 7 C74-ACACTCTGCAAATTCAGT 
010-GCGACCCTACAAGCTACCTGATCTGTGCACGGAACTGAACACTTCACTGCAAGACATAGAAATAACCT-OlO 

6 193 GtGTGTTTTGCAaGAATGCACTGACCACaGCAGAGATtTATtCATATGCaTATAAacACCT;^AGGTc 

ii in LUlUilJli 'll'.U' 1 -!'" 1 " 11,11,11 1,1 "hiii inn iiiiiiiiii 

11 193 GCGTGTTTTGCAgGAATGCACTGACCACcGCAGAGATATATGCATATGCcTATAAgaACCTAAAGGTT 

„ 10 inn mi ii i ii i in imii i in ii i in in 

33 197 GCGTGgAaTGCAAaAAACCtTTGCaaCGAtCTGAGGTATATGAtTTTGCaTTTgcaGATTTAAcaGTT 

i< .'J!. 1 ' !J."i 'I' H " hi iiiiiiiiiiii inn in nun i 

16 192 GTGTGTACTGC AAg cAACAGTTAC tgCGAcgTGAGGTATATGAcTTTGCtTTTcggGATTTA tgcATA 

,i ,™ UJLL HH l l" 1,1111 1 11,1 "m m mm m 

31 196 GTGTcTACTGCAAaggtCAGTTAacAgaAACAGAGGTATTaGAtTTTGCArrTAcAGATTTAacaATA 

25 1S iao JL4J. ' ii mi' " i iimiiiiii ii iiiiiiiiii mini i 

18 199 GTGTaTAtTGCAAgacagtaTTggaActtACAGAGGTATTtGAaTTTGCATTTAaAGATTTAtttgTg 

con GtGTgtatTGCAagaa— catTgacac-a-caGAGgTaTatgaaTtTGCaTtTaaagAttTAa — gT- 

AUS 2 - TACGTGACTGGTGGCCGTCTC-AVS 2 C 7 3 - ACACCTAAAGGTC 

Gc -C 7 4 AUS3 -TGAGGTATATGACTTTGCTTTT- AUS 3 

30 C60-GAGGTATWTGAHTTTGC-C60 01 -CTAAAGGTT 

C6 1-GAGATWTATKCATATGC-C6 1 02 -CTAAAGGTT 
C6 9- ACAGTATTCG AACTTACAG-C 6 9 0 4 - GA TTTCCAA 

C70-CAACAGTTACTGCGACG-C70 06- TT ATGCATA 

C7 2 -GACAGTATTGG AACTTACAG-C 7 0 07 -TTATGCATA 

S 5 -GTGTTTTG C AGG AATG CACTG AC CA- S 5 0 8 - AA TACGTA T 

35 O10-GTGTATATTGCAAGACAGTATTGGAACTTACAGAGGTATTTGAATTTGCATTTAAAGATTTATTTGTG-O10" 

0 1 1- TTATTTGTG 

012- TTATTTGTC 
013 - AATAAA CAC 
017 -CTAAAGGTC 
018 -CTAAAGGTC 

40 Q2Q~GATTTCCAG 

024-GTGTGTACTGCAAGCAACAGTTACTGCGACGTGAGGTATATGACTTTGCTTTTCGGGATTTATGCATA-024 

025-TTATTTGTG 



45 
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6 261 cTGTttCGAGgCggCTaTCCaTaTGCAGCcTGcCCgTGcTGCcTACAAtTtCAtGGaAAAATaAACCA 

111 INI I II III I llllll II II II III inn I II II Mill Mill 

11 261 GTGTggCGACACaaCTtTCCcTTTGCAGCgTGTGCcTGtTGCTTAGAAcTgCAAGGgAAAATTAACCA 

M I MM III MM I III II II II M I M II I 

33 265 GTATATAGAGAgGGaAATCCATTTGgAaTATGTaAactgTGTTTgcgGTTcTtATCTAAAATTAGTGA 

MIMIIMM II lllllll II Mill I HIM III I IMMMMMMI 

16 260 GTATATAGAGAtGGgAATCCATATGC tGTATGTgAtAAATGTTTAAaGTTTTATTCTAAAATTAGTGA 

MMIMI III I HIM II III MIMIIMI 1 1 1 f 1 1 1 1 ill I HIM 

31 264 GTATATAGgGACGacAcACCACAcCgaGTgTGTacaAAATGTTTAAgaTTTTATTCaAAAgTaAGTG A 

M HIM III I III III I M llllll II MMIMI I I I II M 

18 267 GTgTATAGaGACagtAtACCcCAtGctGcaTGccatAAATGTaTAgatrrrTATTCtAgAaTtAGaGA 

con gT-TataGaGacggcaatCCatatGcag-aTGtg— aaaTGttTagaatTttattctAaAaTtAgtgA 

C-44 CTCTGYCGWWAGGTAWACGW-Ci 4 JJl-aattagnga 
C-4SCrCTGrCArATCCCCrACCA-C4 5 AUS8-GTGA 
C-46CCCTCCTGTGrCCrGTCCCT-C46 S6-GT 
C-4 7 CYCTGCYGWMGGTAWACSW-QA 7 
C-4 8 CYCTGYYGWWAGGTAWACGW-C4 8 
C-49 CYCTGYYGWDWGGTAWACSW-Ci 9 
C5 6 -MGAGACRGCWWTCCATWTG-C5 6 
C57-MGAGACRGSWWTCCATWTG-C57 
C 5 8 -MGAGACRGVWWTCCATWTG-C5 8 
C 5 9 - AG AG ACAGT ATACCGCATG -C 5 9 
GTG TG GCG AGACAACTTTC C CTTTGCAGCGTGTG C CTGTTG -O 1 
GTGTGGCGAGACAACTTTCCC-02 

03-CAACTTTCCCTTTGCAGCGTGTGCCTGTTG-03 
CACACCGCTCTGTTGAAAGGGAAACGTCGCACACGGACAAC-Oi 
G TAT ATAG AG ATGGG AATCC A-0 6 

GTATATAGAGATGGGAATCCATATGCTGTATGTGATAAATG-07 
CATA TA TCTCTACCCTTAGGTA TACGACATACACTA 2TTAC-0 8 
09 - ACCCTTAGGTArACGACArACACTATTTAC-09 
010 -GTGTATAGAGAGAGTATACCC CATGCTGCATGCC ATAAATGTATAGATTTTTATTCTAG AATTAG AGA-O 1 0 

30 

GTGTATAGAG ACAGT ATACCG-O 1 1 

GTGTATAGAGACAGTATACCCCATGCTGCATGCCATAAATG-0 1 2 
CACATATCTCTGTCATATGGGGTACGACGTACGGTATTTAC-Q 1 3 
01 4 - GrCArATCGCGTACGACGTACGGTATrrAC-Ol 4 

0 1 7 - CTGTTTCGAGGCGGCTATCCA-01 7 

018- CTGTTTCGAGGCGGCTATCCATATGCAGCCTGCGCGTGCTG-018 

• O19-GCCGArAGGrArACCrCGGACGCGCACGAC-019 
GACAAAGCTCCGCCGATAGGTATACGTCGGACGCCCACGAC- 020 
02 4 -GTATATAGAGATGGGAATCCATATGCTGTATGTGATAAATGTTTAAAGTTTTATTCTAAAATTAGTG A-02 4 * 
GTGTATAGAGACAGTATACCG-02 5 

02 6- C AGTAT ACC C CATGC TG C ATGCCATAAATG - 0 2 6 

40 
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10 



6 329 ATATAG ACACTTTgATTATGCTGgATATGCAaCaACAGTtGAAGAAGAAAC tAAacAAGAcATctTAg 

„ iiiiiiiiiiiii iiiiiiiii iiiiiii i inn iiiiiniiii it mi ii in 

11 329 ATATACACACTTTAATTATGCTCcATATGCAcCTACAGTAGAAGAAGAAACcAAtgAAGATATtTTAa 

IIIIIIIII I IIIIIII III Mill I MM Mill II I II II II 

33 333 ATATAG ACATTAT&ATTATt CTGT ATATGG AAaTACATT AG AACAAAC Ag ttAAaAAAC CTTT a a aTG 

. ,„ M 1 1 II II 1 1 1 1 MM I I IIIIIII 1 1 1 1 1 1 1 1 1 1 1 I II II Ml II II 

16 328 gTATAGACATTATtGTTATAGTtTGTATGGAACAACATTAGAACAgcaAtacAACAAACCgTTgTGTG 

„ „, Mill III I MUM MMMMIIMIMIMM I MIMI I MM 

31 332 ATTTAGAtggTATaGATATAGTGTGTATGGAACAACATTAGAAAAAtTgACaAACAAAGGtaTATGTG 

a „ B Ml III III I I lillllllll Mill MIMI I II MM II III 

18 335 ATTaAGAcatTATtCAgActcTGTGTATGGAgacACATTgGAAAAAcTaAC tAACActGGgtTATaca 



con aTatAGAcatTaTaattAt-cTgt-TATGgAacaACAtTaGAA-Aa-aaactAAcaaag-t-Tat«tg 
atatagacatt-JJl 
GTATAGACATTAT-AUS 8 
1S CSQ- ATAHSACA YATACSTTGWTGTMATCTT-C5 0 

C5 1 -ATAHSACA YATACSTTGWTGTMATC-C 5 1 
CS2-ATAHSACAYATACSTTGWTGTMAT-C5 2 
CS 3 -CTGAGACACA TACCTCTGTGTG TAACC- C 5 3 
C5 4 -CTGAGACACATACCTCTGTGTGTAA-C54 
CS 5-CTGAGACACATACCTCTGTGTG TA-C 5 5 
20 0 1 0 - AT T AAG AC ATTATTC AGACTCTGTGTATGG AGAC ACATTGG AAAAACTAACTAAC ACTGGGTTATACA- 0 1 0 

024 ~GTATAGACATTATTGTTATAGTTTGTATGGAACAACATTAGAACAGCAATACAACAAACCGTTGTGTG--02 4 
TA TA TCTGTGAAATTAA TA CGAC- S 6 

6 397 AcGTGcTAATTCGgTGcTACCTGTGTCACAAaCCGcTGTGTGAAgTAGAAAA q gTAAAaC At ATAcT 

„ , I Ml IIIIIII II II 1 1 II M I II I II III MIMIII IMIMI II 1 1 II III I 

25 11 397 AAGTGTTAATTCGtTGTTACCTGTGTCACAAgCCGTTGTGTGAAaTAGAAAAA cTAAAgCAcATAtT 

„ _ M I MIMI I Ml I Mill I M MIMI IIIIIII I I 

33 401 AAaTaTTAATTAGGTGTATTATaTCTCAAAgaCCtTTGTCTCCTcAAGAAAAAaAAcGACATgTGGAT 

ii iiiiiiiiiiiiiiii iiiiiii ii iiiiiiii iiiiiii ii inn 1 1 1 1 

16 396 ATTTGTTAATTAGGTGTATTAacTGTCAAAagCCacTGTGTCCTGAAGAAAAgCAAAGACATcTGGAc 

„ , n iiiiiiiiiiiiiiiiin i iiiiiii ii iiiiiii iiiiiiii iiiiiiiii mi 

30 31 40 0 ATTTGTTAATTAGGTGTATaAcGTGTCAAAgACCGTTGTGTCCAGAAGAAAAACAAAGACATtTGGAT 

1a , n , HM Mill Mill I Ml II I IIIIIII Mill IMIIIII Mill I II 

18 40 3 ATTTaTTAATaAGGTGccTgcgGTGcCAgAaACCGTTGaaTCCAGcAGAAAAACttAGACAccTtaAT 

con AttTgtTAATtaGgTGtat — tgTGtCAaAaaCCgtTGtgTccagaAGAAAAaca-agAcatct — at 

AUS 4 -AATTAATCCACA TAAT- AUS 4 AUS 9 -GATTTATTTG 

35 AUS 5-TGTCATAACCTTGAATGTCT-AUS 5 

O10-ATTTATTAATAAGGTGCCTGCGGTGCCAGAAACCGTTGAATCCAGCAGAAAAACTTAGACACCTTAAT-O10 
024-ATTTGTTAATTAGGTGTATTAACTGTCAAAAGCCACTGTGTCCTGAAGAAAAGCAAAGACATCTGGAC-024 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



6 464 aaccAACGCgCGgTTCATAAA 

„ ACt inn ii imim 

11 464 gggAAAGCCaCCcTTCATAAAA 

„ ... 11 mini i 

33 469 ttAAAcaAAcGATXTCATAATAT 



16 464 



31 



18 



con 



18 



con 



16 



31 



III 



AAAAAGcAAaGATTCCATAATATA 

JM linn ii iniiii ii in 

468 AAAAAGaAACGATTCCACAACATAG 
Ml Illlll llllllllll 



gCTAAATtgtacGTCGAAGGG TCGcTG 

HUM lllllilll III II 

CTAAATaaCc aGTGGAAGGG TCGTTG 

I I II Mil MM 

TtcGGGTCGtTGGGCAGGGCGcTGTgcGgCgTGTTG 

Illlll III I II II Ml I I Mm 

aGGGGTCGGTGGACcGGtCGaTGTATGtCtTGTTG 

I M lllllil ll II II II I Mill 

GaCGAaCGTGCACaGGaCGtTGCATagCaTGTTG 
I I III 



471 gAAAAacgACGATTtCACAACATAGctgggcactataGAgGccaGtgccattcgTCCtgcaaccGagc 



aaaaAa — acgatTtCAtAA-atag — 
TGGTGTATAGA-AUS9 



ctaaaggacg-tgGgcagggcg-tgcatggct-Gttg 

AOS6-AAATGTATAGATTTTTATTC-AUS6 



C65-CAACCGAGC 

0 1 0 ^AAAAACGACGATTTCACAACATAGCTGGGCACTATAGAG GC CAGTG CCATTCGTGCTGCAACC GAGC -0 1 0 
02 4 -AAAAAGCAAAGATTCCATAATATA AGCGGTCGGTGGACCGGTCGATGTATGTCTTGTTG-024 



6 512 



11 512 



33 528 



16 



31 



CcTACACTGC 

I llllllll 
CtTACACTGC 



C 
I 

C 



TGGACAACATGCATG GAAGACaTGT 

1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 IMIM III 

TGGACAACATGCATG GAAGACtTGT 
I Mill II | || I I H || 

gaggtcccgACGTAGAGAAACTGCactgtgAcgTGTAAAAacgcCATGagagGACACaagcC 

Mill MM III I Mill MM MM I 

523 cagatcatcAAGAaCACGTAGAGAAAC CCAgcTGTAA tCATGCATGGAGAtACAC 

MM I MM Mill III Mill IMM Illlll MM 

527 GagAAGACCtCGTActGAAAC CCAagTGTAA aC ATGCg TGG AG AaACAC 

I MM I I II I lllllll I 

539 acgacaGgaAcGACtcCaacgacgcAgagaaacaCaAgtataAtattAaGtaTGcAtggACctaaggC 

— ga — gagaagaccacgta-aga-Actgca ccaggtgtAaaacatgcaTGgagagAcacaaggc 

C 6 4 -GAACACGTAG AGAAAC CCAG-C 6 4 

ACGACAGGA-C65 
C 6 6 -GAGGTCCCG ACGTAG AGAA-C 6 6 

O10-ACGACAGGAACGACTCCAACGACGCAGAGAAACACAAGTATAATATTAAGTATGCATGGACCTAAGGC-O10 
02 4 -CAGATCATCAAGAACACGTAGAGAAAC CCAG-02 4 



6 547 TACCCTAAAGGA 

MM llllllll 

11 547 TACCCTAAAGGA 

II lllllll 
33 590 aACgTTAAAGGA 

II II I II 
579 TACaTTGCAtGA 

III Mill II 

577 TACgTTGCAAGAC 
III 



TATtGTAtTAGACCTGCAaCCTCCaGACCCTGTAGGGTTACATTGCTATG 

in m iiiiiiiiii urn f f 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 f 1 1 1 1 1 

TATaGTAcTAGACCTGCAgCCTCCTGACCCTGTAGGGTTACATTGCTATG 



mi 

ATATGTtTTAGA 

mi i inn 

ATATaTGTTAGA 

ill mini 

TATgTGTTAGA 

111 



I lllllll 



TTT a tATC C TG AaCC AACTG Ac C T ATACTG CTATG 

III I II II II II II I I I I II II I I I I 
TTTGCAACCaGAGaCAACTGAtCTCTACTGTTATG 

IMIMIII Ml lllllll III lllllilll 

TTTGCAAC C tC AG gCAACTGACCTCCACTGTTATG 

1Q CfVT I I MUM I III I | 

18 607 aACaTTGCAAGACattgtaTtgcatTTAGAgccccaaaAtgaaattcCggtTGACCTtCtaTGTcAcG 



con tAC-tT — AgGAc at-tgt-tTAGAcctt catcc-ga-cCa— tGaccTacacTG-tAtG 

BE16-ACCAGAGACAACXGAXCXCXACXGX-BE16 
BE 1 8-GXXAGAXXXGCAACCAGAG ACAACXGAXCXCXAC- BE 1 8 

O10-AACATTGCAAGACATTGTATTGCATTTAGAGCCCCAAAATGAAATTCCGGTTGACCTTCTATGTCACG-O10 

C89-G 
C90-G 
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10 



15 



CGTCGACgAaGTGGACggACAAGAttCACAACCT 

iiiiiii i hum nun mum 

GGTG GAC aAG GTGG AC a aACAAGAcGC AC AACCT 

IIIIIIII 
GCACAACCa 



6 €09 AGCAATTAGtAGACAGCTCAGA AG AT G A 

nmim 1 1 1 1 1 1 1 1 1 1 1 1 mm 

11 609 AGCAATTAG aAG ACAGCTCAGA AGATGA 

iiiiiiii mimim iiiii mil i n in 

33 649 AGCAATTAAgTGACAGCTCAGAtGAGCATCAaCgctTGGACcGGCCAGATGGACAA 

iiiiiiiii iiirmimi iiiii n i i ii i mi mini 

16 638 AG CAATT AAaTG ACAGCTCAG AgGAGGAGGATG OAATAG At gGTCCAGCTGG ACAA 

mum immmi immm mn immmmi 

AG C AATT Ac cC G AC AG CTCAG At GAGGAGGATG t c AT AGAc aG TCCAG C TCG ACAA 

iiiiiiii mi nil i mi iiiii 



31 636 



III III 
GCAGAACCG 

Illllllll 
GCAGAACCG 

III 

18 675 AGC AATTAa gCGAC t cag agG AaGAaaAc G ATGa aATAGA tggagttaatcatcaacatttAcCaG 

AGCAATTAa — GACagctcaGAtga-gAtGAtga-aT-GAc-gg-c-gatggaeaagacgcacAaCcg 
AGC AATTAGWAGAC -C 8 9 BE 8 -GACG AAGX CGACGGAC AAGAXXC- BE 8 

AGCAATTAARYGAC-C90 BE9-GA GGXGGACGAAGTGGACGGACAAGATTCACAACC-BE9 

BE13-XGA GGXGCACAAGGXGGACAAAC-BE13 

BE 1 4 * AGAAGAXGA GGXGGACAAGGXGGACAAACAAGACG-BE 1 4 

BE 1 5 -CAG AACCG 
BE 1 7 - AC AAGCAG AACCG 
C 6 2 -CG AAGTGG ACGG ACAAGAT -C 6 2 
C 6 3 -CAAGGTGGACAAACAAG ACG-C 6 3 
010 - AGC AATT AAGC G ACTCAG AG G AAG AAAAC G ATG AAATAGA TGGAGTTAATCATCAACATTTACCAG 



20 



25 



30 



6 671 TTAAaACAACATTtCCAAATAgTGACCTGTTG 

mi mum iiiiiii iiiiiiiiii 

11 671 TT AACACAAC ATTAC CAAAT AcTG ACCTGTTG 

mi mn i ii i iiiiiiii 

33 714 GcCACAGCtgATTACtAcATTGTAACCTGTTGT 

i ii iii iiiii i iiiiiiiiii mi 

16 703 GACAg AG C C cATT ACAATATTGTAACCTTTTGTTG 

llll I II IIIIIIIIII II IIIIIIIIIII 
31 701 GACAcAtCCaATTACAATATCGTtACCTTTTGTTGT 

II I I III II 



CTGTGGATGTGAC 

U 1 1 1 1 M I M M 

CTGTGGATGTGAC 

III II 
caCActTGTaAC 

II III II 
CAAGTGTgACT 

mn i i 

CAGTGTAAGT 
IIIII 



AGCAACGTtCGA 

mum in 

AGCAACGTcCGA 

I II II II 
ACCACaGTTCCt 

I II llll 

CTACgCTTCGg 

uu mn 

CTACACTTCGt 
I 



IB 741 ccCgacgagccgaACcAcAaCGTcACacaaTGTTGTgtatgtgtTGTAAGTgtgaagCcAgAaTTgag 

ron g-cacag— cattaCcA-At-gT-ACctgtTGttgt-ctg— g-TGT-Act accAcagTtcg- 

GACAGAGCCCAX-BE 15 BE 1 9 - AGXGXGACX CX ACGCXXCGG 

GACAGAGCCCA-.BE 17 BE20-XXGCAAGXGXGACX CXACGCXXCGG 

BE 2 4 -XXGXAAGXGXGAAGCCAGAAXXGAG 
BE25-AXGXGXXGXAAGXGXGAAGCCAGAAXXGAG 
0 1 0-CCCGACGAGCCGAACCACAACGTCACACAATGTTGTGTATGTGTTGTAAGTGTGAAGCCAGAATTGAG-0 1 0 
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75 



6 728 CTGGTTCTGc^TCtACAGAaacAGAC^ 

11 ''MIL iJLU 'I' 11 I II Mill II II Mil I III II Mil I II Mil | .|| • 

11 728 CTGCTTGTGgAGTGcACAGAcgGAGACATCAGAcAAcTACAAgAcCTTtTG CTGG GCACACTAAATAT 

„ JJJi i J 1111 1 111 1 11 1111 ' H i i i inn 

33 771 TTaTGTGTc aAc AC tACAGc aaG tGACcTaCGAACc aTACAg cAaCTacT tATGGGCACAq To AATAT 

ii ii ii i ii Ml III I II II I I I II I I II llll II I || 

16 760 TTGTGcGTACAaAGCACACAcGTAGACATTCGtACtTTGgAAGAcCTGTTAATGGGCACAcTaGGAAT 

„ , to ion inn mum inn inn i in mi nmiiimi u i nm 

31 758 TTGTGtGTACAgAGCACACAAGTAGAtATTCGcAtATTGCAAGAGCTGTTAATGGGCtCAtTtGGAAT 

i m i iii ii ii ii iiii ill ii ii i mi ii i i i i 

18 809 cTagtaGTAgAaAGCtCAgcAGacGAccTTCGagcATTcCAgcAGCTGTTtcTGaaCaCccTgtcctT 
con -Tg~ tGTacAgaGcaCAgaag-aGAcaTtcGaacatTgcAa-AgCTgtT-aTCggcaCacTaaa-aT 

XXG-BE19 BE29-AGCAAGXGACCXACGAACCAXACA-BE29 C4 2 -CCCGTGTGAYYYDTA 
XXGXGCGXAC-BE20 C 4 3 - CTTGTCCGACA GGAA 

CXAGX-BE25 

BE30-AGXACAGCAAGXGACCXACGAACCAXACAGCAACX-BE30 
010 -CTAGTAGTAGAAAGCTCAGCAGACGACCTTCGAGCATTCCAGCAGCTGTTTCTG AACACCCTGTCCTT-0 1 0 

6 796 aGTGTGTCCCATCTGCGC AC CgAAgaCcTAACAAcGATGGCGCACGATTCAGGTACAGAAAAT 

ii 7Q< JUUiiUUii^ ii III i '"in ii nmiiiii iiiiii minim 

11 796 TGTGTGTCCCATCTGCGC AC CaAAACCaTAACAAgGATGGCGGACGATTCAGGTACAGAAAAT 

mm ii i iii ii ii mi i i i urn n i mini i 

33 839 TGTGTGCCC tAcCTGTGC ACAacAAtAAACATCAtCtAcaATGGCcGATcCTGaAGGTACAaAtGgg 

1£ „ fi IIIMIIII I llll I II II I III I I | | | I I I 

16 828 TGTGTGCCCCAtCTGTTCT CAgAAACcATAATCTACcATGGCTGATCCTGCAGGTACcAATGGGGAa 

,i iiiU 111 " 11,11,1 1,1 m i m i m m m i 1 1 1 m 1 1 mini 

31 826 cGTGTGCCCCAaCTGTTCT aCtAgACtGTAA CTACAATGGCTGATCCAGCAGGTACAGATGGGGA 

i B .,7 JiULLL Hi. J. J 1 11,1 1 111 i"" Hiiiiiii nm 

IB 877 tGTGTGtCCgtggTGTgC atCccagCaGTAAgCaACAATGGCTGATCCAGaAGGTACAGAcGGGGA 

con tGTGTG-CCcatcTGtgCtaca-aaacaataatcaaCaAtg G-t g — gg ta-ag-ggat 

C40-CACACaCCCrACACJ?CC-C4 0 C75-ATGGCKGAYCCTGMAGGTAC-C75 
C4 1 * CACACAOTCACCACACG-C4 1 C76-ATGGCKGAYGATTCAGGTAC-C76 
ACACAC- C 4 2 C 7 7 - ATGGC KG AY CCTTCAGGT AC - C 7 7 

ACACAC-C 4 3 C8 1 - TACCGWCT*G<Lft CXTCCA TG- C8 1 

C 8 2 - TACCGMCTRCTAAGTCCA TG-C 8 2 
C 8 3 - TACCGMCTRGGAAGTCCA TG-C 8 3 
01 0 -TGTGTGTCCGTGGTGTGC ATCCCAGCAGTAACCAACAATGGCTGATC-O10 

QC 6 859 GAGGGGTCtGGGTGTACAGGATGGTTTATGGTAGAAGCtATAGTgcAaCACcCaACAGG TAC 

„ .„ iiii mi iiiMiiiiiniuuiiiiiiiiiiii inn i in i inn m 

11 859 GAGGGGTCGGGGTGTACAGGATGGTTTATGGTAGAAGCcATAGTAGAGCACACtACAGG TAC 

„ one I H' hiiiiiii H huh hiiiiiii i nm i nm 

33 906 GctGGGAtGGGGTGTACTGGtTGGTTTgAGGTAGAAGCaGTcaTACAGAgAAgAACAGG aGA 

OQC i ii i in mi m mm i mn u n mm ii iiiiii ii 

dfi 16 895 GaGGGtACGGGATGTAATGGaTGGTTTTATGTAGAgGCtGTAgTgGAaAaAAAAACAGG GGA 

„ aD1 in 1 1 1 1 1 1 1 1 mn mimimiii n in i n m i mn in 

31 891 GGGgACGGGATGcAATGG t TG G TTTTATG T AG AAG C a G TAaT t G AC Ag Ac Ag ACAG G GGA 

m nm 1 1 n ii mimum mi i Mm i i mn 

18 94 3 GGGcACGGGtTGtAAcGGcTGGTTTTATGTAcAAGCtaTtgTaGACAaAaAaACAGGagatgtaat 

45 con gagGGgacgGGgTGtA-tGGaTGGTTTta-GTAgAaGCt-TagTagA-aaaaaaACAGG a 

C7 8 -TGTAMWGGMTGGTTTTATGT-C7 8 " 

C79-TGTAMWGGMTGGTTTGAGGT-C7 9 

C 8 0 -TGTAMWGGMTGGTTTATGGT-C 8 0 

C 8 4 -ACA TKWCCKACCAAAATACA- C 8 4 

C 8 5-ACATXWCCJCACCAAACreCA-C 8 5 
50 C 8 6 -ACATXWCCJCACCAAATACCA-C 8 6 
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6 921 ACAAATATCAGAcGATGAGCAtCAGGACGTGGACGACAGTGGGTATGACATGGTGGACTTTATTGATG 

„ llllllllllll IMIIIII IIIIIIIIIilllllllillllllllllllllllllllllliiiM 

IX 921 ACAAATATCAGAAGATGAGGAaGAGGAGGTGGAGGACAGTGGGTATGACATCGTGGACTTTATTGATG 
_ I II llllllllllllll II I II IIIIIIM || I I II lllll | | | | 

33 968 TaATATtTCAGAMATGAGGAtGAAAcaGcaGATGACAGTGGcacgGATTTacTAGAgTTTATAGATG 

„ i Hi UUI n ii i i n inn mi mini 

16 957 TgcTATaTCAGAtCACGAGAAcCAAAAtCacAGTCAIACaGGtGAaGATTTGGTACAtTTTATACtaA 

„ «, iUJlUUiU 1 11 111 1 11111111 11 11 i" '"I H "I" i i 

31 951 C aacATTTCAGAGG ACG AAAATG AAgAC AG tAGTGATAC tGGGGAGGATATGGTtGAcTTTATTGAcA 

t , I IIIHIIIIIIII III) I II III MINIMI II IIIIIIM I 

18 1009 ate a gaTgacGAGG ACG AAAATG caACAG AcACaGGG t cGG ATATGGT aGAtTTTATTGAtA 

zon a-aaat-tcaGA-GA-GAg-AtGaa-a-g-ggatgAcA-tGGgtagGAtaTggTaGAcTTTATtGat- 
6 989 A CAGc aATATTACA CAcAATTCacTGGAAGCACAGGCATTGTTTAAcAGGCAGGAGGCG 

„ AOA I HI IMIIIII II lllll llllllllllllllllllllll llllllllllll 

11 989 A CAGgcATATTACA CAAAATTCtGTGGAAGCACAGGCATTGTTTAATAGGCAGGAGGCG 

„ i i i -1 1 1 1 ii i i ii in i iiiiiiiiiiiiii inn i i 

33 1036 ATtcTAtgGAAaATAgtATACAGGCAGAcACAGAGGCAGCcCgGGCATTGTTTAATAtACAGGAAGgG 

ii ii ii ii i Miiimi mm mi i n mini i iimm 

16 1025 ATgaTAATGAtTATttaAcACAGGCAGAAACAGAGACAGCACAtGCgTTGTTTAcTGCACAGGAAGCa 

„ 1A1A ii inn ii i mmm mmiimii n mm iiiiiimm 

31 1019 ATtgTAATGtATAcaacAAtCAGGCAGAAgCAGAGACAGCACAGGCATTGTTTCATGCACAGGAAGCg 

III llllim HIM I Ml IMIIIII MM lllll lllll I 

18 1071 cacaaggaacATtttgtgAaCAGGCAGAgctAGAGACAGCACAGGCATTGTTcCATGCgCAGGAgGtC 
:on attataatgcatatataataCAggcagA — cagaG-cAGCaCagGCaTTGTTtaat-c-CAGGA-Gcg 
6 1048 GAcaCcCATTATGCGAC TGTG CAGGACCTAAAACGAAAGTATTTAGG t AGTCCATATGTt AGTCCTAT 

II I 1 1 1 M 1 1 i ! 1 1 1 1 ! ! I II M 1 M 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 II lllllllllll 1 1 1 1 1 1 1 1 

11 104 8 GAtGCTCATTATGCGACTGTGCAGGACCTAAAACGAAAGTATTTAGGcAGTCCATATGTaAGTCCTAT 

II I I I I I lllll I I 1 I I 1 I f I I 1 t I I I I 

33 1104 GAgGATgATTtAaATGCTGTGtgtGcaCTAAAACGAAAGT TTGCCgc 

I I II I Mil II I 1 1 1 1 1 1 1 1 1 1 1 E { I I I 

16 1093 aAacAACATagAGATGCaGTaCAGGTTCTAAAACGAAAGT AT TtGGTAGTCCa 

I Mill Ml II II 1 1 ! 1 1 1 1 1 1 i 1 1 1 1 1 1 1 M II I MIIIMI 

31 1087 gAggAACATGCAGAgGCtGTGCAGGTTCTAAAACGAAAGT ATgTaGGTAGTCCt 

i i mil i i mi in imimiiii 

18 1139 cAcaAtgATGCAcAaGtgtTGCAtGTTtTAAAACGAAAGT ttgcaggaggcagcacaga 

zon gA-gatcATt-agaggctgTgcagGttcTAAAACGAAAGTatttagg-agtcca— tgtga-tgcc-t 

BE 1 -XAAAACGAAAGX-BE 1 
BE 2 - AGG ACCXAAAACGAAAG XAX X XAG - B E 2 
BE3-AGGXXCXAAAACGAAAGXAXXXGG-BE3 
BE4-AXGXXXXAAAACGAAAGXXXGCAG-BE4 
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6 1X16 AAaCAcTaTAGCcgAgCCACTgGAAAGTGAaATAAGTCCACGaTTgGACGCCATTAAACTTACAAgAC 

ii ii i mi i i M 1 1 iiiinii milium n immmimimi n 

11 1116 AAGCAaTgTAGCTaATGCAGTaGAAAGTGAgATAAGTCCACGgTTaGACGCCATTAAACTTACAAcAC 

M I I III I II II III II II I 

33 1151 ATGttcacaAagSQcTGCgGagGAcgtTGTtGAtcGTgCTgcaAacCCgtGtAgAAcgtCTATtAaTA 

I II I III II I I I II I II Mil II 

16 114 6 cT t AGTGATATTAG • TGG aTGTGTaGAcaAT AATATTAGTCC t aGaTTAAAAGCTATATGTA 

i 1 1 m 1 1 iii i i i mil ii 1 1 1 1 m m m i ii i mini mi i 

31 1141 tTAAGTGATATTAG TaGtTGTGTGGATtATAATATTAGTCCACGGTTAAAAGCTATATGcA 

i urn mm i i mmiimi in mi 

18 1198 aaAcagtccATTAGgggagcggctggagGTCGATacagAgtTaAGTCCACGGTTAcAAGaaATATctt 
con a-aca-tatAttagaggcagtggaa-gtGtggatagt — t-taagtcc — g — taaaagctAta-gta 



6 1184 AGCCAAAAAAGGTAAAGCGACGGCTGTTTcAAACcaGGGAAcTAACGGACAGTGGATATGGCTATTCT 

iimmmmmmuiimii mi urn uimmmmummni 

11 1184 AGCCAAAAAAGGTAAAGCGACGGCTGTTTgAAACAcGGGAAtTAACGGACAGTGGATATGGCTATTCT 

i m i i i m i n ii urn iiiinii] nil 

33 1219 AAaAtAAAgAAtGcAcAtacaGAAAAcGaAAAATAgaTGAgcTAGAAGACAGCGGATATGGCAATACT 

I I m ii i I i mi i i i ii in milium uuiuimi 

16 1207 T AG AAAAAc AAAG T Ag Ag C t GCAAAAaG g AG A t TATTTG AAa g cG AAG ACAG C GG GT ATG G C AATAC T 

iiiiiii i mi i i mm i in i mm mimiuiiiuimm 

31 1202 TAGAAAATaAcAGTAAAACaGCAAAAcGaAGACTcTTTGAAcTtcCAGACAGCGGGTATGGCAATACT 

ii m i im mm i i ii in ii mi n u mm i u 

18 1266 TAaAtAgTgggcag AAAAagGCAAAAaGgcGgCTgTTTac AaTatC AGAtAG tGGcTATGGCt gT tCT 

:on -a-aaAaaaag-g-Aaa — ag-aaaa~g-a-aatatttgaacta-caGAcAG-GGaTATGGC-aT-CT 

JJ3-tatggctantct 
C8 7 -ATACCGTTA WGA 
C 8 8 -A TACCGAYA WGA 



6 1252 G AAGTGG AAGC TG g aa c gg g AACG CAGGTAGAGAAACA TGGCG 

nuimimi mi umuiimu urn 

11 12S2 GAAGTGGAAGCTG CAACG CAGGTAGAGAAACA TGGCG 

Illllllll II I llll llll I I 

33 1287 G AAGTGG AAACT CAGCAGAT GGTA CAACA GGTAG 

iimmiui mum i m inn 

16 1275 GAAGTGGAAACT CAGCAGAT G tTACA GGTAG 

imimui iiiiuu i mi urn 

31 1270 GAAGTGGAAAC g CAGCAGAT G gTACA GGTAG 

miiiiii i * urn mi mi 

18 1334 GAAGTGGAAgC aacaCAGATtcaggtaacTACAaatggcgaacatggcggcaatgtatGTAG 

:on GAAGTGGAA-Ctg gca-caGataggtagag — ACA GtaG 

gaagtggaagctgnnnnncnacagat-JJ3 
C2TCACCT-C87 

crrcAccr-css 
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6 1295 taCCGGAAAATGG CCGAGATGGTCAGGAAAaGGA 

minimi iimmiiimiii in 

U 1289 A CCCGGAAAATGG CGGAGATGGTCAGGAAAgGGA 

I lllllll II I II II 

33 1321 A — AagtcAAAATGG cgACAC AaACttaaAtGActtaGA 

I I I llll I III 

16 1306 A A gggcCccatgagactgAAACACcAtgtagtcAgtAtagtGg 

I II III 

31 1301 A GGAG CAAC AAACA AC 

llll llll llll II 

18 1396 tggcggcagtacGGAGgctatagaCAACgggggcacagagggcAACA AC 

con a — — ggag aacgcaaaatgg-- — aga — gaaacacgagatggtcaggaaaggga 

6 1329 CACAGGaAGGGACATAGAGGG GGaGGAACATAcAGAGGCGGAAGCgcccacaaACAGtgtaC 

HIM! IIIIIIIIIIIIII II IIIIIIII IIIIIIIIIIII llll I 

11 1323 CACAGGgAGGGACATAGAGGGTgagGgGGTGGAACATAgAGAGGCGGAAGCagtagacGACAGcaccC 

I I III II I I I II I I II III. 

33 1358 atCtAGTGGCGtgGGgGAtGaTtcaGAaGTaAGctGTgagacaaatGtAGaTagctGTGAAA 

lllllll II I I II I III IIIIII 

16 134 9 tggAAGTGCCGgtGGttgcagTcagtAcaGTAGTGGaagtggGGGAGagGgTgtTAGTGAAAGAcAcA 

Mil I IIIIII llll lllllll II I I 

31 1317 AttAAGT tgtaATgGTAGTG ACGGGA cAcATAGTGAAcGAgAgA 

I I llll II II llll 

18 1445 A gcagtgtagacggTacaAGTG AC aAtAgcaatAtaGAaAat 

con a-caagtagggacagaga-ggt-agga-g— -agtgataga-cgg— gaagca agtgAaaga-a — 

6 1391 GgGAGCATGCAGgCACAgCAGGAATAT TgGAATTgtTAAAATGTAAaGATtTAC GggCagCATT 

I IIIIIIIIII llll lllllllll I lllll IIIIIIIIII III III I I Mil 
11 1391 GaG AG CATG CAG AC ACAt C AGG AATAT TAG AATT acTAAAATGTAAgGATATAC GAtCtACATT 

II II MM II III I I I I I I I I I III I 

33 1420 atgttACgttgCAGGAA AT TAG t AATG TTC T Ac At Ag T AGT AAT AC AAAAG C AA a tAT 

II II I lllll Ml I I lllllll MM Ml II 
16 1417 cTatAtgcCaAACACcacttacAA ATATTTTaAATGTaCTAAAAACTAGTAATGCAAAgGCAGCaAT 

I I I lllll I llll II I II IIIIIIIIII I M I II II II II 

31 1361 aTgAAaCtCCAACAC GtA ATATATTgcAaGTGTTAAAAACTAGCAATGgtAAAGCtGCTAT 

I II I III I I I II I II II M I I I MM Ml lllll 

18 1487 gTaAAtCcaCAAtgtaccataGcAcAatTAaaagActTGTTAAAAgtaAaCAATaaacAAGgaGCTAT 

con gtgaa — t-caa-c-ca-caggaAtAtattagaaatgtt-tAaaaa — ag-aaTacaaaagcagc-aT 

6 1455 ACtTGGTAAGTTTAAAGAaTGCTTTGGGCTGTCtTTTaTaGATTTAATTAGGCCATTTAAAAGTGATA 

II MMMIIIIMIII MIIIMMIIMI Mi I II 1 1 II II 1 1 II I II II I II II II II II 

11 1455 ACaTGGTAAGTTTAAAGAcTGCTTTGGGCTGTCaTTTgTtGATTTAATTAGGCCATTTAAAAGTGATA 

I III IIIIIIII II Ml I Ml I II Ml I II MIMMIMIIMM 

33 1478 ATTAtaTAAATTTAAAGAGgcCTATGGaaTaAGTTTTaTgGAATTAGTAAGACCATTTAAAAGTGATA 

III IMIIMMMI II II I IIIIII II II M II I II 1 1 II I II 1 1 II M III 

16 14 84 GTTAGcaAAATTTAAAGAGTTATAcGGgGTgAGTTTTtcaGAATTAGTAAGACCATTTAAAAGTAATA 

lllll MIIIIIMM Mill II II MMM III II I II IIIIII I II I llll 
31 14 22 GTTAGgtAAATTTAAAGAaTTATATGGtGTAAGTTTTAtgGAAcTAaTTAGgCCATTTcAAAGcAATA 

HIM IMIIMM MMM II MM II II llll Ml llll III 

18 1555 GTTAGcagtATTTAAAGAcacATATGGgcTAtcaTTTAcaGAttTAgTTAGaaatTTTaAAAGtgATA 

con -ttaggtaaaTTTAAAGA-t— TatGGgcT tTTTataGA-tTA-TtAG-ccaTTTaAAAGtgATA 

Jj4-ttagttagaccatttaaaagtgata 
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6 1523 aAACaACATGTttaGATTGGGTGGTaGCAGGgTTTGGTATACATCATAGCATAtCAGAgGCATTTCAA 

III IIIIII lllllllllll Mill IIIIIIIIIIIIMIIIIIII I I I I IHIIIIII 
11 1523 gAACcACATGTgCcGATTGGGTGGTtGCAGGATTTCCTATACATCATAGCATAGCAGAtGCATTTCAA 

III I III I IIIIII I IIIIII III II II Mill 1 1 II II 

33 1546 AAACAAgcTGTaCiGATTCGTGTATaaCAGGATaTGGAATTAgTCCatcagTAGCAGAaAGTTTAAAA 

II III III lllllllllll I I II IMI III II Nil II Ml Mill 

16 1552 AAtCAACgTGTtgcGATTGGTGTATtGCTGCATTTGGAcTTACACCcAgtaTAGCtGAcAGTaTAAAA 

ii ii mi iiiiinii i him Mini ii ii i i i n ii i i mi 

31 1490 AAAgCACaTGTACtGATTGGTGTgTAGCTGCgTTTGGAGTTACAggtACAgTtGCAGAAGGATTTAAA 

Ml Ml Mill IIIIII I 1 1 1 1 IIMIIM I III I lllllllllllllll 

18 1623 AAAcCACgTGTACaGATTGGgtTacAGCTataTTTGGAGTaAacccaACAaTaGCAGAAGGATTTAAA 

con aAac-AcaTGTacaGATTGG — t-tagC-ggaTtTGGaaT-aatccta-aaTagCaGAaggatTtaAA 

aaacaacNtgtNcagattgg-JJ4 

6 1591 AAaTTAATTGAGCCATTAAGTTTATATGCACATATACAATGGCTaACAAATGCATGGGGAATGGTAtT 

II 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 It 1 1 1 ! 1 1 f IIIIIIIIIIIIIIIIIIMI I 

11 1591 AAgTTAATTGAGCCATTAAGTTTATATGCACATATACAATGGCTtACAAATGCATGGGGAATGGTAcT 

IIIIII I I Mill III I III 1 1 1 1 1 1 1 III I - I Mill II I 

33 1614 gtATTAATTaAACAgcATAGTTTgTATaCtCATtTACAATGTTTAACtTGcGataGaGGAATaaTAaT 

I II I MM II 1 1 II III II I III Mill I II I HIM I I 

16 1620 ACACTATTaCAACAATATTGTTTaTATTtaCAcaTtCAAAGTTTAGCATGTTCaTGGGGAATGGTTgT 

II HIM MM IHIIIIII IMI II I IIIIIIIIHIIHIM Mill IIIIII I 

31 1558 ACcCTATTgCAACCATATTGTTTgTATTgCCATtTaCAAAGTTTAGCATGTTCcTGGGGcATGGTTAT 

II III I II MM I II III II 1 1 I III II III III Mill I I II 

16 1691 ACaCTAaTaCAgCCATtTataTTaTATgcCCATaTtCAAtGTcTAGacTGTaaaTGGGGagTatTaAT 
con aca-TAaTtcA-Ccat-tagtTTaTATgcaCAtaTaCAAtGt-Ta-catgtgcatGgGGaaTggTaaT 
6 1659 gTTAGTATTAtTAAGaTTTAAAGTAAATAAaAGtAGAaGTACCGTtGCACGTACAcTtGcaACGcTAT 

1 1 1 1 1 II 1 1 IMI IIIIIIIIIIIMI II III 1 1 1 1 1 1 1 lllllllll I I III III 

11 1659 ATTAGTATTAATAAGgTTTAAAGTAAATAAgAGCAGATGTACCGTgGCACGTACAtTaGGTACGTTAT 

1 1 1 1 I Mill II llll I II I III I II II Ml I I Ml MM 

33 1682 ATTAtTgTTAATtAGATTTAggTGTAGcAAAAACAGgTtAACAGTaGCAAAAcTAaTGaGTAAtTTAT 

Ml I III I MM II III I I II II II llll I I II II I II III Ml 

16 1688 GTTAcTaTTAGTaAGATaTAAATGTGGAAAAAATAGAgaAACAATTGAAAAATTgcTGtcTAAAcTAT 

Mil I I II llll IIMIIM II I Mill II III I MM Ml II II I III III 

31 1626 GTTAaTgCTtGTGAGATtTAAATGTGcAAAAAATAGAaTAACAATTGAAAAATTaTTAgaaAAATTAT 

111 ii ii i i 1 1 1 1 1 1 1 ii i mi him m m m i ii i 

18 1759 aTTAgccCTgtTGcGtTacAAATGTGgtAAgAgTAGAcTAACAgTTGctAAAggtTTAagtAcgTTgT 
con -TTAgtatTa-TaaGaTttAaatgt-gtAAaA-tAGa-taACagTtGcaaaa-tatTaggtA-gtTaT 

6 1727 TAAATATACCTGAAAAcCAaATGTTAATaGAGCCaCCaAAAATACAAAGTGGtGTtgcAGCCCTGTAT 

immmiimi n mimi inn it mmmiim n imiimi 

11 1727 TAAATATACCTGAAAAtCAcATGTTAATtCAGCCtCCtAAAATACAAAGTGGcGTAcGAGCCCTGTAT 

M IMIIIMM Ml I II Mill II III Ml II I Ml Mill 

33 1750 TATcaATACCTGAAAcaTGTATGgTtATAGAGCCaCCAAAATTACGgAGccaAaCAtGtGCATTGTAT 

Ml I II II MMM I I II M Ml IMIIMI II II I II Mill III 

16 1756 TATGTgTgTCTcCAAtgTGTATGaTgATAGAGCCtCCAAAATTgCGTAG t ACAGCAGCAGCATTATAT 

I Ml I III III IIIIII I II MM II "Mill HIM Mill IIIHIIIIIII 

31 1694 TgTGTaTATCTaCAAaTTGTATGTTAATTCAGCCACCcAAATTaCGTAGcACAGCTGCAGCATTATAT 

I II II || IIIIIMIIIIIIII Mill Mill II II I I M M II I Mill 

18 1827 TacacgTAcCTgaAAcTTGTATCTTAATTCAaCCACCaAAATTgCGaAGtAgtGtTGCAGCAcTATAT 

con TatataTacCTgaAAattgtATGtTaAT-gAgCCaCCaAAAtTaCgaAGtagag — gcaGCa-T-TAT 
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6 1795 TCGTTTcCtACAGGtATaTCAAATGCcACTACAGTTATAGCGCAaGCaCCaGAATGGATAACaCGCCA 

nun i inn ii 1 1 1 1 1 1 1 1 miiiiiiiimm n n minimi inn 

11 1795 TGGTrTAGgACAGGcATtTCAAATGCaAGTACAGTTATAGGGGAgGCgCCgGAATGGATAACgCGCCA 

1 1 1 ii 1 1 1 mi n urn m n in i ii mmiii i i 

33 1818 TGGTTTAGAAC AGc aATgTC AAAc ATTAGTGAtGTacAAGG t ac aACaCC tGAATGGATAg AtAG ACt 

mi ii inn ii urn mum n i n n n mmiii i mi 

16 1824 TGGTATAaAACAGGtATaTCAAAtATTAGTGAaGTgTATGGaGAcACgCCAGAATGGATAcAAAGACA 

inn i linn ii mil inn n n inn n n iiimimn mini 

31 1762 TGGTAcAGAACAGGAATgTCAAAcATTAGcGAtGTATATGGtGAaACACCAGAATGGATAgAAAGACA 

inn mmiim inn inn n in n n inn n mm mm 

18 1895 TGGTAtAGAACAGGAATaTCAAAtATTAGtGAaGTAatgGGaCAcACACCtGAgTGGATACAAAGACt 

con TGGT-tagaACAGgaATaTCAAAtattAGtgaaGTaa-aGG-gaaaCaCCaCAaTGGATA-aaaGaCa 
BE32-AXAXCAAAXAXXAGXGAAGX-BE32 JJ6-tggataNaaagaca 

6 1863 aACaGTTATTGAACAcgGgTTGCCaGACAGTCAgTTTAAATTAACaGAAATGGTGCAGTGGGCgTATG 

ii iiiiimiii i inn 1 1 i 1 1 1 1 1 minimi miiiimmnn mi 

11 1863 gACcGTTATTGAACATAGtTTGGCTGACAGTCAATTTAAATTAACTGAAATGGTGCAGTGGGCATATG 

ii iii i mini n in i i mi i mi mmiii miiiimm 

33 1886 AACtGTTTTACAACATAGcTTTAATGATAaTAtATTTGAtTTAAgTGAAATGGTACAGTGGGCATATG 

iii ii minimi mmiii n mm m i iiiimi inn n i 

16 1892 AAC AGT ATTACAACATAG TTTTAATG AT t g TAC ATTTGAaTTATC aC AgATGGTACAATGGGCcT AcG 

llllllllllll llllllllllllll llllllll II II II 1 1 1 1 1 1 i I i I ! 1 1 1 II I 

31 1830 AACAGTATTACAgCATAGTTTTAATGAcAcaACATTTGATTTGTCcCAAATGGTACAATGGGCATATG 

ii i mi m i i mi i i minimi mmmiiimm n 

18 1963 tACtaTtaTACAaCATgGaaTagATGAtAgcAatTTTGATTTGTCagAAATGGTACAATGGGCATtTG 
con aACagTt-TacAaCAtaGttTt-atGA-agtaaaTTTgA-TTa-cagAaATGGTaCA-TGGGCaTatG 
aacNgttatacaacatagtttNgatgat-JJ6 

6 1931 ATAATGAcATaTGcGAgGAgAGTGAaATtGCATTTGAATATGCACAaaGgGGAGAtTTTGAtTCtAAT 

imm n ii ii ii nm n iimmmimn i nm inn n in 

11 1931 ATAATGAtATtTGtGAaGAaAGTGAgATaGCATTTGAATATGCACAgCGTGGAGAcTTTGAcTCcAAT 

mi n i n ii nm n mi i i iiiimi i n m i i m 

33 1954 ATAAcGAgtTAacgGACGATAGTGAcATTGCATATtAtTATGCACAACTTGCAGAtTCaAATAGTAAT 

mi n ii 1 1 1 1 1 1 ii 1 1 1 imimi i mmiii i nm i mmiii 

16 1960 AT AATG Ac aTAgTaG ACGATAGTGAAATTGCATATAAATATGC ACAATT gGCAG AC ACTAATAGTAAT 

i nm i i ii imiiiiiiim 1 1 1 1 1 1 1 it i m i i m i n mi i imim 

31 1898 AcAATGAtgTtATgGATGATAGTGAAATTGCcTATAAATATGCACAATTAGCtGACAGTgATAGTAAT 

i nm i i nm ii ii ii ii i i mini mm nm i n m 

18 2031 AtAATGAgcTgAcaGATGAaAGcGAtATgGCaTtTgAATATGCcttATTAGCaGACAGcaAcAGcAAT 
con AtAAtGA-aTaa — GA-GAtAGtGAaATtGCaT-TgAaTATGCacaatt-GcaGAc — t-AtagtAAT 



6 1999 GCAcGaGCaTTTTTAAATAGcAATATGCAGGCaAAATATGTgAAAGATTGTGCAAcTATGTGtAGACA 

m i ii f 1 1 1 1 1 1 1 1 1 1 milium iiiimi mninmii mill nm 

11 1999 GCAaGgGCcTTTTTAAATAGTAATATGCAGGCtAAATATGTAAAAGATTGTGCAATTATGTGcAGACA 

ii ii mum nm n n m nm n mi in nm inn 

33 2022 GCtgcTGCaTTTTTAAAAAGTAAcTCACAaGCAAAAATaGTAAAGGAcTGTGgAATAATGTGTAGACA 

ii iii m imiimi nm iimm imim mi n immiim 

16 2028 GCAaGTGCcTTTcTAAAAAGTAATTCACAGGCAAAAATtGTAAAGGATTGTGcAACAATGTGTAGACA 

III I I I I III III I II I I III I I I I I III III I M I II I III I I I III I II III I I III I 
31 1966 GCAtGTGCaTTTTTAAAAAGTAATTCgCAGGCAAAAATaGTtAAAGATTGTGgAACAATGTGTAGACA 

iii iii milium mt n ii in i iiimmi mmiim i m 

18 2099 GCAgcTGCcTTTTTAAAAAGcAATTgcCAaGCtAAAtattTaAAAGATTGTGccACAATGTGcAaACA 
con GCa-gtGC-TTTtTAAAaAGtAAttcgCAgGCaAAA— tgTaAAaGAtTGTGcaAcaATGTGtAgACA 
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« 2067 yy*y*^^*^^*^***JGAggAAGATCTCTATaAAACAATGCATaAAacATAGGGGTtCTAAAaTaG 

,„„ IIMIIIIMIIIIIIIIII MIIMIIIII J 1 1 1 1 1 1 1 1 1 1 || III Mill HIM I I 

11 2067 TTATAAACATCCAGAAATGAaaA^ 

1111111 1 I HI II UNI II lllllllll I III II III 

33 2090 "ATAAAaAAGCACAA^ 

16 2096 BfK^^ 

3! 203, ^ MS 

JJiii 11,11 1 1 1 1 1 1 i 1 1 1 1 1 1 1 iii iiiiiiiii i iimn in i 

TTATAggCGAGCccAAAAACGACAAATGaatATGtcACAGTGGATacgAttTAGATGTtcaAAAaTag 



18 2167 
con 



. WATAaac-aGCagAAAaa-ga-AaATGtctATgagaCAaTGGATaaaataTAGatGTg-tAaa-tag 

JJll-tggataaaatatagatgtNctaaaatag 

6 2135 AagGcacAGGa^tTCGAAaCCAATTGT^ 

u iiiiiiiiii nun i in i in 1 1 1 1 1 nun i it mi inn 

16 2164 ATGATGGAGGTGATTGGAaGCAAATtGTtAtGTTTTTAAGgTATCA^ 

m i nun h i i ii ii i iiiiiiiii nun mi ni imini 

31 2102 9TGAcGaAGGTGAcTGGAGGgAcATAGTaAAGTTTCTAAGATATCAACAAAT 

i 11 H urn inn i ii i mi minimi m i iiiii 

18 2235 aTGAaGggGGaGAtTGGAGaccaATAGTgcAaTTccTgcGATAcCAACAAATAGAgCTTaTaaCAITT 
con a *9atggaGG~AtTGGA— ccaAT-GTacagTTt-TaaGatAtCAAaa-aTaGAaTTtat — CaTTT 

atgatggaggaaattgga-JJll JJ12-cattt 

6 2203 TTAActAAAtTtAAATTATGGCTGCACGGtACGCCaAAAAAAAAcTGcATAGCCATaGTAGGcCCtCC 

. . „ n , mi in i immmmim iiiii imim hi mil 1 1 1 1 

11 2203 TTAAGcAAAcTaAAATTATGGCTCCACGGaACGCCcAAAAA^ 

33 2226 TTAgGTGCATTtAAAAagTTTTTaaAAGGtATACCaAAAAAAAgcTGTATgcTAA 

lfi „„ in mm mi n ii mi u M 9 M , un im 

16 2232 TTAaCTGCATTAAAAAgaTTTTTgcAAGGcATACCtAAAAAAAAtTGcATaTTAcTATATGGTGCAGC 

„ hi 1 1 1 m 1 1 1 1 1 mil mi i ii ii iiiii ii n m n mum i 

31 2170 TTAtCTCCATTAAAgctgTTTTTAAAAGCAgTgCCaAAgAAAAAcTGTATtTTAaTAcATGGTGCAcC 

1B „„, hi ii inn i iii inn iii ii ii mn m i ui m in 

18 2303 TTAggaGCcTTAAAatcaTTTTTAAAACGAaccCCcAAaAAAAAtTGTtTagTAtTttgTGGacCAgC 



con 



TTAa-tgcatTaAAattaTtttT— AaCCaa-gCCaAAaAAAAa-TGtaTagtaaT-t-tGG-cCa-C 
ttaagtgcattaaaattatttttgcaaggNacNccNaaaaaaaa-JJ12 

6 2271 aGAtACTGGGAAaTCGTaCTTTTGtATGAG TTTAATaAgcTTTcTaGGaGGtACAGTTATTAG^eAT 

„ „„ H niiiiii mi mm mn mm i m i u n imiiiiim n 

11 2271 tGAcACTGGGAAGTCGTgCTTTTGcATGAG TTTAATtAAGTTTTTgGGgGGaACAGTTATTAGTTAT 

„ „ n , mi ii inn i iii i mn mm mini ii mn i i 

33 2294 aAAtACACGaAAGTCATatTTTGGaATGAC TTTAATacACTTTTTaaAACGGTgTCTTATATcaTaT 

, £ „„ ii inn ii mi mn mn mm inn mm m mi i i 

16 2300 TAAcACAGGTAAATCATtaTTTGGtATGAG TTTAATGAAaTTTcTgCAAGGGTcTCTAATATqtTtT 

„ „„ m mi mn mn in m m i iiiii i i mm i T 

31 223 8 TAATACAGGTAAATCATATTTTGGAATGAGccTTATTGAgCTTTtTACAAGGATgTaTAATATCATaT 

, G mum 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 im mi mum iiiiiiiiii 

18 2371 aAATACAGGaAAATCATATTTTGGAATGAGttTTAT acaCTTTaTACAAGGAgcagTAATATCATtT 
con -aAtACaGG-AAaTCaTatTTTgGaATGAG-tTTAataaacTTTtTacaaGGatc-gT-ATat taT 
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6 2338 GTaAATTCCaGCAGCCATTTtTGGtTgCAaCCgtTAgtaGATGCtAAgGTaGCATTgTTaGATCATGC 

II llllll llllllllil III I II II II Mill II II MM! II 1 1 1 1 1 1 1 1 

XI 2338 GTtAATTCCtGCACCCATTTcTGGcTaCAGCCAcTAaCgGATGCAAAAGTgGCATTaTTgGATGATGC 

II 1 1 1 1 1 II II M Ml I MMM M I IIIIIMM I I I I I Mlllll 

33 2361 GTAAATTCTAAAAStCAcTTTTGCTTgCAGCCATTAtCAGATGCAAAAATAGGAATGaTAGATGATGt 

M 1 1 1 1 1 1 1 1 1 1 f i II llllllll II llllll Mlllll MIMIII III II M I II M 

16 2367 GTAAATTCTAAAAGc CATTTTTGGTTACAACC ATTAGC AG ATGC C AAAATAGG t ATGTTAG ATG ATGC 

I llllll Mill I II I ! i 11 1 1 ! 1 1 1 1 1 1 1 I II Mill llllllll MIMMMIIMI 

31 2306 GcAAATTCaAAAAGTCATTTTTGGTTACAACCAcTgGCtGATGCTAAAATAGGCATGTTAGATGATGC 

I Mill I IMIIIMIIMM MM I I III MM I I 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 

18 2438 GtgAATTCcActAGTCATTTTTGGTTggAACCgtTaaCaGATaCTAAggTgGcCATGTTAGATGATGC 
con GtaAATTCcaaaAG-CAtTTtTGGtT-cAaCCatTagcaGATgCtAAa-TaG-aaTg^TaGATGATGc 



6 2406 aACACAgCCATGTTGGAtATATATGGATACATATATGAGAAAttTgTTACATGGTAATCCTATGAGtA 

Mill MIIIIMM Mllllllllllllllllllllll I IIIIIIMIIIIIIIIIIII I 

11 2406 cACACAACCATGTTGGACATATATGGATACATATATGAGAAAccTaTTAGATGGTAATCCTATGAGcA 

II I I I f 1 1 1 1 f 1 1 1 1 i 1 1 III II llllllll MIMIII III II I 

33 2429 aACgCcAatAaGTTGGACATATATAGATGAtTAcATGAGAAATGCgTTAGATGGAAATgaAATTTCaA 

II llllll II llllllll I I llllllll If IIIIIMM I MM I 

16 2435 TACAgtGCCcTGTTGGAAcTAcATAGATGAcaAttTAAGAAATGCATTgGATGGAAATttAGTTTCTA 

1 1 1 1 III MMM I II Mill I I II 1 1 It i 1 1 1 I Mill II II I Ml 

31 2374 TACAACGCCaTGTTGGcAtTAtATAGAcAAtTAccTAcGAAATGCAcTAGATGGCAAcCCtGTATCTA 

II II I MMM II I II I II I Mlllll 1 1 1 1 1 1 1 1 1 i II II II 

18 2506 aACgACcaCgTGTTGGacaTActTtGAtAccTAtaTgaGAAATGCgtTAGATGGCAAtCCaaTAagTA 

con aACaccgccatGTTGGacaTAtaTaGAta— tAtaTgaGAAAtgc-tTaGATGG-AAtcc-aT tA 

JJ15-gttggacatatatNgatacNtatatgagaaatgcgttagatgg-JJ15 

6 2474 TtGAcAGAAAgCATAaAGCATTgACATTAATTAAaTGTCCACCtCTgCTaGTaACgTCcAAcATAGAt 

I II Mill I II I MMM IMMMIMI llllllll II II II II II II Mill 

11 2474 TAGATAGAAAACATAGAGCATTAACATTAATTAAgTGTCCACCgCTaCTgGTTACaTCAAATATAGAc 

Mill llllllll llllll I I II llllllll II II MM Mlllll I I 

33 2497 TAGATGTgAAACATAGgGCATTAGTgCAAtTAAAATGTCCACCACTgCTtcTTACcTCAAATAcAAAT 

I Mill II IMM MM II III Mlllll II III I I MM II II I III 

16 2 503 TgGATGTAAAGCATAGAcCATTgGTaCAAcTAAAATGcCCTCCATTATTaATTACATCTAAcATtAAT 

I II II II 1 1 III 1 1 II II III Mlllll Mill MM IIM Ml II Ml II III 

31 2442 TaGATGTAAAGCATAAAgCtTTAATgCAgtTAAAATGTCCTCCtTTATTgATTACATCTAATATAAAT 

I III llllll Ml I Mill II I I I I I I I I 1 I I I I II I I II I llllll II 
18 2574 TtGATagAAAGCAcAAAcCaTTAATaCAacTAAAATGTCCTCCaaTAcTacTaACcaCaAATATAcAT 

con TaGAt — aAAgCAtA-agCaTTaatacaa-TaAAaTGtCC-CCacTacTa-TtACatCaAAtAtaaAt 

6 254 2 ATTAct AAAGAaGAt AAATAt AAgTATTTACATAcTAGAGTa ACaACATTTACATTTCCAAATCCATT 

MM Mill II Mill II llllllllil llllll II lllllllllllllllllllllll 

11 2542 ATTAGCAAAGAgGAgAAATAcAAATATTTACAXAGTAGAGTtACcACATTTACATTTCCAAATCCATT 

III III I II IIIMIIMMIIIM I II II 1 1 IIM IIIIIMM 

33 2565 GCaGGCACAGAcTCTAGATGGCCATATTTACATAGTAGATTaACaGTATTTgaATTTaaAAATCCATT 

II II Mill Mill Mill MINIUM llllll II III MM III II 

16 2571 GCtGGTACAGATTCTAGgTGGCCtTATTTACATAaTAGATTGGTGGTgTTTACATTTCCtAATgagTT 

II MM Ml II Mill II llllll III Mlllll MIMMIMI III II 

31 2510 GCACGTAACGATgAcAGATGGCCATAccTACATAGc AGAcTGGTGGTtTTTACATTTCCAAATcCATT 

III llllll I MIMMIMI II I II III I II Ml I Ml I II II I MM 

18 264 2 cCAGcaAAGGATaAtAGATGGCCATAttTAgAaAGtAGAaTaacaGTaTTTgaATTTCCAAATgCATT 
con gc-ggtAaaGAtgatAgaTggccaTAttTAcAtAgtAGA-TaacagtaTTTacATTTccaAATccaTT 
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11 2610 G C C C TT*TG A GAG AAmA^TG G GAA.7G G A.G T A 7 A.7G T ATCA. G A.TGCJ^AA.CTG G A-A-A.i G "i^TT <r "riio AAA. 
33 2633 CC«TTTCAtGA^TGGtAAcCCACTCTATG 

1 1 1 1 1 ! 1 1 MTTI II II IlllllliM | Him | || llllillllllllilllll 

31 2578 G ^ GA.A«A.TG GA.G T A.T A."TG A-A 1 7 A^A. G A.7^^AA^A.iA.G ^^GG AA^A.7 GG "I^i^T^X^T C T^G^A-A. 
18 2710 TCCATTTGAtAAAAAtGGci^ 

con — CCaTTTGAcaaAAAtGG-AAtcCAGT-TATGaacTaaatgAtaaaAAcTGGAAAT— TTtTT AA 

6 2678 G ACTGTC GTC aAGC CTAG ACATTc AGG ATTC t G AGG A CGAGGAA GATGGAAGCAATAGCCAA 

„ ™ iiiiiiiiii iiiiiiiiiiii linn inn limn iiiiiiiiiiiuiiiii 

11 2678 GACTGTCGTCCACCCTAGACATTgAGGATTCAGAGGA CGAGGAA GATGGAAGCAATAGCCAA 

i 11 n ii mi i n hum i inn dim i mm i 

33 2701 GGACGTGGTgCAaATTAGATTTa taGAGGAAGAGGA CAAGGAAAACcATGCAGGaAATATCagc 

ir M M I II II II MM III Mill Mill II MUM II MIMI II 

15 2707 ^^^y^y C ^^^^^^^^^^^^^^^ c ^^^ CAAGGAAAACGATGGAGACTCTTTgcCA 
31 264 6 GGACGTGGTgCAGATTAAATTTGCACGAGGAAGAGGA C^AAGAAAACGATG^ 



INI 1 1 1 1 MIIMI MIIMIIIMMIIMM III I III III II Mill I 

18 2778 GGACaTGGTcCAGATTAgATTTGCACGAGGAAGAGGAagatgcAGAcAcCGAaGGAaACcCTTTCggA 

con GgacgTgGTccAgatTAgattTgcacGAggaaGAGGA— c-agGAaaacgAtGGA~ ca-T-tcc-a 

6 2740 <^ <3 "TTT-A.^S .A.*2T a ^» <^ <^ <3 JPlC^ JPL^a. <^A.<S ^^^^^S^^^T-A ^S.A-A. 'P^A^^^^S.A^A.^S J 1 ^-^^^^^^^^^^^ ^S^VjC ^ ^\^^^A.^^ 

11 2740 GCGTTTAGATGCGTGCCAGGAtCAGTTGTTAGAACTTTA^ 



„ m-mi mm MM I I I MM Mill MM II I II 1 1 III II 

33 2 /66 ACGTTT^TCCagtgC^^ 

16 2772 ACGTIT^TGTGTG 

31 2711 ACGTTTAAATGTGTGTCAGGACAAAATAtTAGAACA 



18 2846 ACGTTTAAgTt gcgt gCAGGACAAAATcaTAGAcCAcTATGAAAATGAcAGTAAAgacaTagacagcC 
con ACGTTTAaaTgcgtg-CAGGAcaAaaTatTAgaaC-tTAtGAA-atgA-AgtAc-gaccTacacaaaC 

6 2808 AtgTatTGCATTGGAAATGCATgaGAcatGAAAGTGTATTAtTAtAtAAAGCAAAACAAATGGGCCTa 



11 2808 AcATTaTGCATTG^ 

33 2834 AaATTGAaCATTK 

16 2840 ATATAGACTATTG^ 

31 2779 ATATAGACTATTGGAAACAtATtCGaCTW 

1B , aiJ 111 I HUM Ml II II Ml II I II I I I 1 1 1 1 i I Ml || Ml 

18 2914 AaATAcAgTATTGGcAACtaATaCGttggGAAaaTGcAaTAtTcTtTgcAGCAAGgGAAcatGGcATA 
con At aTagag-ATTGGaAAc-cATacGact5GAa-gTG-atTatt«tataaaGCaA-a-AAatgGG~Ta 
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6 2876 AGCCACATaGGaaTgCAAGTAGTgCCACCATTAAagGTGTCcGAagCaAAAGGACATAATCCcATTGA 

IIMIIII II I IIMIIII llllllllll Mill II I 1 1 1 1 1 II M II M I Mill 

11 2876 AGCCACATcGGgtTaCAAGTAGTaCCACCATTAACtGTGTCAGAGACtAAAGGACATAATGCtATTGA 

M I I II II II II I II I III Mil 1 1 1 1 I I I I Mill 

33 2902 tcACATtTatGCCACCAgGTGGTGCCttCtTTGttaGcATCAAAGACcAAACCATTTCAAGtAATTGA 

i it i i iiiiii iimiiii i ii i iimiiii iimiiii nil linn 

16 2908 aAACATATTAACCACCAaGTGGTGCCAaCacTGgCtGTATCAAAGAatAAAGCATTACAAGCAATTGA 

I 1 1 1 M 1 1 M 1 1 1 IIIMMM I II I IMMMM 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill 

31 2847 CAcAgTATTAACCACCAGGTGGTGCCAGCgTTGtCaGTATCAAAGgccAAAGCcTTACAAGCTATTGA 

II I I 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 I I Mill Mill llllllllll 

18 2982 CAgAcatTaAACCACCAGGTGGTGCCAGCcTataacaTtTCAAAaagtAAAGCacataAAGCTATTGA 

:on aaccataTaa-ccacCA-GTgGTgCCa-Cattgac-gtaTCaaAgactAAAGca — t-AaGctATTGA 

JJ18-tcaaagactaaagcacataaagctfattga 

6 2944 AATGCAAATGCATTTAGAATCaTTAttAAggACTgAGTATaGTaTGGAACCgTGGACATTACAaGAaA 

1 1 1 1 1 1 f 1 1 1 f I f 1 1 1 1 1 1 1 1 III II III Mill II IIIIIII IMMMIMI II I 

11 2944 AATGCAAATGCATTTAGAATCcTTAgcAAAAACTCAGTATgGTgTGGAACCtTGGACATTACAgGAcA 

i i iiiiii mil I in iii i iiiiii ii i iimiiii ii i i 

33 2970 ACTaCAAATGgCaTTAGAgACATTAagTAAATCACAGTATAGXAcaagcCAaTGGACAXTgCAAcA&A 

Ml III I I Mill Ml II III lllll IIIIIII I IIMIIII III I 

16 2976 ACTgCAAcTaACGTTAGAAAGAaTAtATAACTCACAATATAGTAATGAaaAgTGGACATTaCAAgAcg 

mi m i i iii iiiiii ii mil i mm i mil i mill 1 n i 

31 2915 ACTaC AAATG At G TT g G AAAC A tT Aa AT AAC AC Tg AAT AC AAAAAT G AG G Ac TG G AC AaTG CAg c Aa A 

mi Mini i ii i ii i m m i ii iiiii mm mm i i 

18 3050 ACTgCAAATGgcccTacAAggccTtgcacAaAgTcgATACAAAAccGAGGAtTGGACAcTGCAagAcA 

AcTgCAAaTg-c-tTagAaacatTa aaaactca-TAtagta — gaaca-TGGACAtT-CAagA-a 

actgcaaatgg-JJ18 



con 



6 3012 CaAGTTATGAAATGTGGCaAACACCACC tAAACGcTGtTTTAAAAAACgGGGcAAaACTGTaGAaGT 

i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 mimii iiiii n mimiimi m n inn m n 

11 3012 CcAGTTATG AAATG TG G C T AAC AC CAC C CAAACGgTGcTTTAAAAAACAGGGAAAtACTGTgGAgGT 

I M I II IIIIIII III I I II IIIIIIIIMI III I M II II 

33 3038 CaAGCTTaGAgGTGTGGCTttgTGaACCACC A a aAT GTTTTAAAAAACAaGG Ag AaAC AG T a a c t G T 

Ml I II III I I II Mil I I MM I 1 1 1 1 1 f ! I Ml I IIIII II 

16 3044 ttAGCCTTGAAGTGTATTTAACTGCACCAAC AGGATGTaTAAAAAAACATGGATATACAGTgGAaGT 

„ . m iiiiii m 1 1 1 1 m it 1 1 1 n i ii iii iii mmimiiimiii II II II 

2983 C AAG t C TTG AAC TGTATT T AACTG CACCTAC AGGgTGTTTAAAAAAACATGGATATACtGTAGAgGT 

ii i mil i ii i mm mmm m i n m i n 

CAtGcgagGAACTaTggaatACaGaACCTACtcactgcTTTAAAAAA ggTGGccAaACaGTAcAaGT 
caaG-t-tGAa-TgTggctaac-gcACCaacaa-g-tgttT-AAAAAacatGGa-A-AC-GTagaaGT 



31 



18 3118 



11 



33 



3079 tAAATTTGA 

MINIM 

3079 aAAATTTGA 

Ml III 

3105 GCAATaTGA 

III I Ml 
16 3111 GCAgTTTGATGG 

Ml Ml I II | | 
31 3050 GCAaTTTGATGG 

i mm 



TGGCTGTGcAaACAATacAATGGAtTATGTCGTATGGACAgATgTgTAtgTGCAGG 

in mil i iiiii iiiiii mi iiiii ii ii iii ii i ii mm 

TG G CTG TG AAg AC AAT g t AATG G Ag T ATG TG GTATG G ACAc AT AT AT Ac CTG CAG G 

ii i iii mm m mi i iiiii i i 

caatGACAAAaAaAATACAATGGATTATACAAACTGGggtgAaATATATATTataG 

MM I lllllllll 1 1 1 1 1 1 1 1 f 1 1 1 1 1 I IMMMM I 

aGACATAtgCAATACAATGCATTATACAAACTGGAcAcATATATATATTTGTG 

II M III II IIIIIIIIMI IIIIIII I IIIIII I III 

tGAtGTAcACAAcACcATGCATTATACtAACTGGAaAtTTATATAccTaTGTa 
• ^ ' ' ' " I 



------ iiiiiiiii ii i ii ii i I i ill l 

18 3185 atAtTTTGATGGcaacaAaGacaAttgtAtgAcctATgtagCatgggacAgtgTgtatTAtaTgacTg 

on gcAaTtTGAtggcaacgatgaaaacaatacaAtggAttat-caaactggacagatataTAtaTg — tg 
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6 3144 ACAAtGACaCcTCGGTAAAgGTgcaTAGTatgGTAGATGCtAAGGGtATATACTAcACATGTGGACAA 

mi iii i iiiiiiii ii mi mum m hum iminniii 

11 3144 ACAAcGACtCAiGGCTA^ 

i i ii mi i ii ii i nun i ii ii 1 1 1 1 i 

33 3170 AgGAA^t»CATCtACTATGGTWCaGGgaAAGTAGATTATAtaGGTATgTAriATATA<^TaActGt 

i mi n in mi ii mi mm I in i i 

in i I nni 1 1 n ii mm ' mum miimmi 

31 3115 tAGAtGCccaATCtAcl^^ 

I II III I I I I I I III I III I I I I I I I I I I I I 
18 3253 atGcaGGaacATGggacaaaaccGctacctgtGTaAgTeacAgGGGatTgTATTATGTAaAgGAAGGg 

eon a-gaaGacaeatgg-cta-ggt-g-t-gt-aaGTagattataagCGtaTaTATTAt-tacatgaagga 
6 3212 IIIAAAACATATTATGTAAAcTTTgtaAAAGAGGCAgAAAAGTATGGgM 

1 1 ii i ii 1 1 1 1 ii ii in 1 1 ill ill minimi 

31 33.3 c*,Jl=iiiiyiiJ.J^^i>^^ 

II II I II I II I I | | III III III I I IIIIIIII 

18 3321 tAcAacACgTtTTaTaTAgAaTTTAaAagtGAatgtgAAAAATATGGGAacacaggtAcgTGGGAAGT 
con t-taaaacaTaTT-TgtaaAtTTTaaa-aaGAggcagaAAA-TATgg-Aa-ac-aaaaa-TGGGAAGT 
. 33.0 = CC«C = - 

33 330. iwicoiiTCOTCCOIUT M™"™;™ ,TC«TATCTMC» ACM UU 
16 3312 tCATGCGGGTGGTCAGGTAAT atk«K«ACATCTGTgTTTACCAGCA ACGA AgTAT 

mi nun iii i mm I I in nil Mil I Ml 

31 3251 gCATGCGGGTCGTCAGGTAATTG TTTtTCCTgaATCTGTaTTTAGCAG TGACGA AaTAT 
III II I I I II III I I Mill III III III 

18 3389 aCAitttGGgaai-aAtGTAATTGattgTaaTgactctatgtgcagTAcCAG TGACGAcacggTAT 

acaT--.GGt-<j t -agGTaATtg-at-tt-Tcct g catc-tct-t--c.AGcactgac-a a gaagTAT 



AC 

II 
AC 



6 3342 CCATTcCTGAA tCTACTACATACACCCCCGCACAGACC tCcaCCCT tCTGTCCtCaaGC AC 

33 33.3 iiiiWUli c^i44Jii4U^iUiiUUi.4,ciU4.„iUiUJ«i 1 1 

3,33.3^^11^,^1 ,111111 »C - 

l l l l l l l l l l l l ll ill l ' 1 l l 

16 3371 ccTcicCKAiitiATiaggcagci cttgGCC ^CcACcCCgCCgCGACCcATAC 

III II ill III 1 1 1 1 1 1 1 1 1 I I III 

31 3310 CCi tTGCTggGATTGTTAcAaAGCTACcaacaGCC AACaAC AC Ca CC aCAt CG a AT TC 

ii i hi 1 1 1 1 1 1 i Mini I inn ii ii ii mi 

18 3454 CC gckcicaiciiGTTUcAGCTAC AgcACACCcCCtCAcCGtATTC 

con CCact-cTgaaa ttgacatacAcccacgcacagacc— c— caacaac-cctcc-Caacc-ataC 

CC GCXACXCAGCXXG-DE2 1 
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6 3403 CA aGGAAGACCcaCTCcaaaCGCCGCCTAGGAAaCG 

U 3406 cic 11111,11 111 IIIIIMMIIII II 

11 3406 CAC CGAACACGGcGTGtcggCGCCGCCTACGAAgCG 

IN III D 

33 3410 CACAAGC agcggccAAACgacGACGAC cTgCAGacACCA 

" '"I mi linn i iii mi 

16 3426 CA AAGCCgtCGCCTTGGGCACC GAAG AAACacaGACGAC TATccAgCCACCAAGA 

nil ii in mi ii in i mi mm n iniiimii 

31 3368 CA AAACCtgCGCCTTG^ 

ii in II llllllll I III I III) llllll 
18 3503 CA gcACCgtgtCCgTGGGCACC GcaAAGaccTaCGGC caGACGTC 

BE10-GGCGXGXCGGCGCCGCCXAG-BE10 



con 



18 
con 



CAcaaaccgtcgccttgggcacc-g-gaaggcgtacgaagac-gacgacgtcc-cc-ag— accaaca 
6 3439 AGCACGaggagtccaACaGTCCcCTtgCAACgCCtTGTGTGTGGCCcACATt gGAcCCGTGGACAGTa 

,i llllll ii mi H mi ii lllllllllli III! II lllllllllli 

11 3442 AGCACG tggACCGTCCaCTaaCAACaCCcTGTGTGTGGCCaACATCaGAtCCGTGGACAGTA 
33 3449 CAGACACCGCCcAgCCCCT t acaaAgcTGTTc tGTGCA gaCccCgCCtTGGACAaTA 

hiiiiih linn I | mi I I llllll I 

16 3481 tCAGAGCCACACACCG GAAACCCCTgCCACaCCActAAGTTGTTGCAcaGaGACTCaGTGGACAGTG 

Niiiiiin i inn n im m 1 1 i 1 1 1 1 1 i i i inn niimm 

31 3435 aCAGACCCAGAGcaC aGAAACaCCCaCCACcCCAacAAGTTGTTGCGAgGCGACTCcGTGGACAGTG 

354a I , J I J. M HI I I I I || II I llllll II 

3548 G 9CtgctAcACgaCCtggACaCtgtggAcTcGcgGaGAaGCagCattGTGGAC cTG 

acaaagccaga— ccgc-aaaCccct-c-acaccatgt-tttggtgcacagcggctccgTGGACagTg 

6 3507 gAAaCcACAACcTCaTCACTaAC AATcACgACcAGCACCAAA GAcGG AACAACaG 

n MA ii i inn ii inn ii in ii ii iiimiii ii ii mm i 

11 3504 CAAtCaACAACATCGTCACTgAC AATtACAACAAGCACCAAA CAaGG AACAACTG 

33 3506 gAACAgcaCgtACTGCAACTAACtGC aCAAACAAGCAgCGGA cTgtGTGT AgTTc 

16 3548 cTCCAaTcCtcACTGCAttTAACAGCT CACACAAAggACGGA tiaaCTGT AaTag 

31 3502 TCAACTgtggggTTaTCaGTGCAGCT gcatgCACAAAccAAACAA GGgCTGTCaGTtGTcc 

. _ , CAJ II] I J IIIIIIIIIIII Mill II llllll 

18 3604 TCAAC ccacTTcTCgGTGCAGCTacacctacaggcaacAACAAaagacGGaaacTCtGTaGTgg 

con c— aac-c— cactgc-actaaCagctaat-c-aacaagcacca-Aa gggtgtcaaca-t-g 

6 3562 TaAC AGTt CAGCTACGCCTATAGTGCAAtT tCAAGGTG AaTCCAATTGTTTAAAqTGTTTTAGATATA 

„ , cea 1 Mill IIIIIMIIIIIinillll I MIM, II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IIIIIIIIIIMI 

U 3559 TcACAGTGCAGCTACGCCTATAGTGCAAcTgCAAGGTGAtTCCAATTCT 

3 i i' iiiii! i 1 111 1111 iiiiiiii i iiiii i m ii i iii 1 1 ii i ii ii 1 1 

31 3563 TgcaACTACA CCTATAATACAcTTAAAAGGTGATGCAAATAtaTTAAAATCTTTAAGATA 
18 Ml* I iiiii I I I" I I I "I lllllllllli III I IIIIIIIIIIII I II 

18 3668 TaacACTAC gCCTATAATACAtTTAAAAGGTGAcagAAAcAgtTTAAAATGTTTAcGgTA 

con TaacacTaCagctacgCCtATAgT-CAttTaaAAGGTGAttcaAAtagtTTAAAaTGTTTaaGaTAta 

JJ20-cattcaaaaggtgaNtcNaatagtttaaaatgtttaagatata 
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6 


3630 


11 


3627 


33 


3621 


16 


3663 


31 


3623 


19 


3728 


con 





GgCTaAATGACAgAcAcAgACATTTaTTTGAtTTAatATCaTCAACCTGCCAcTGGGCCTCctCaaAG 

i M mini i i i mill inn m m minimi iiiimi i n 

CaCTgAATGACAaATATAAACATTTGTTTGAaTTAgcATCtTCAACGTGGCATTGGGCCTCacCtgAG 

*™ i _ mm i mi ii ii n ii iiimm n i 

cAGATTaAAAcctTATAAAgAgTTGTATAgTTCtaTGTCaTCcACcTGGCATTGGACCaGtgAcAAc 

mn Ml II IIIIMI I I MM || || ! 1 1 1 1 1 1 i 1 1 1 I | || 

TAGATT tAAAAAgc AT t g t a c ATTGTAT AcTg C AGTG TC gTCTAC ATG GC ATTGG AC Ag G a cA tAAT 

m i in it iiiimi mill nimmiiimm i n 

TAGg cTG t cAAAAtATAaaeAATTGTATgaAcAAGTGTCATCTACATGCCATTGGAC At GtacagAT 

n ii mi m i in i i i mn n iiniimm 

cAGatTGcgAAAAcATAgcgAccacTATagAgAtaTaTCATCcACcTGGCATTGGACA ggtgc 

g-agattt-aaaaa-Ata-aca-ttgTaT-a-t-a-t-TCaTC-AC-TGGCAtTGGaC-tg-cc-aa- 
gcagatt-JJ20 

6 3698 GCACCACATAAA C ATGCcATTGTAAC t gTAACAT ATga t AGTGAGGAACAAaG gCAaCAgTTTT 

miiiiiiiii mi mmii mm n iiiiimim i n n mi 

11 3695 G CACC ACATAAAA ATGCAATTGTAACatTAACAT ATAgcAGTGAGGAACAACGtCAgCAaTTTT 

i mn in 1 1 1 1 1 1 1 1 1 mm i ii mm i mi 

33 3688 a aAAAt ag TAAAA ATG gAATTGTAACt gTAACAT tTGtaAcTGAAcAGCAACAAC AaatgTTTT 

m mn ii mm ii i mi n i mi mn i mi 

16 3730 GtAAAACATAAAA gTGCAATTGTtACacTtACATATGatAGTGAATgGCAAC GtGACcAaTTTT 

„ , COA i nimimi m mn ii i mm m n in i m i mi 

31 3690 G GAAAACAT AAAAA TGCtATTGTaACctTAACATATataAGTacATCACAA AGAGACgAtTTTT 

1G i n mn i ii i ii mmi iii i mn m i mi 

18 3791 aGgcAAtgaAAAAAcaGgaATacTgACtgTAACATAccatAGTgaAaCACAA AGAacaaAaTTTT 
con g-a-aacatAAAaaatGcaATtgTaACtgTaACATatgatagt-aa-agcAAcaaag— aacaaTTTT 

6 3762 TAgAtgtTGTAAAAATACCcCCtACCATTAGcCA CAAacTGGGaTTTATGTCAcTgCACcTATTGTA 

„ „„ ii i illinium n minii n n mi miiim i n mini 

11 3759 TAaAcAgTGTAAAAATACCACCcACCATTAGgCAT AAGgTGGGgTTTATGTCAtTaCATTTATTGTA 

„ , ii i 1 1 1 1 1 1 1 1 m 1 1 1 ii ii m ' i i mm 

33 3752 TAggTAccCTAAAAATACCACC tACTgTgcaAAT AAG TACTGGATTTAT 

„ „ n i i ii iiimm m i ii 1 1 1 1 1 1 1 1 1 1 1 1 

16 3794 Tg t c Tc aaGT tAAAATACC A AAaACTaTtaCAGT GTC TACTGGATTTAT 

„ , i i ii mmii ii n i mi in mn 

31 3754 TAAATACTGTaAAAATACC tAAcACagTatCAGT GTCaacaggatatatgactATTTA 

I 1 I 1 1 1 I I I I 
18 3856 TAAATACTGT 

con TaaatactGTaaaaataccaccaaaca-tagcaat-aaggtcgg-tttatgt-actg-atttattgta 
6 3829 AtttgtatatatgtaaAtgtgTaaATATATGgTATtgGTGTAatacaActgTACaTGTATGGAaGTgG 

^ c i i i ii iii ii iii mn i m mi m i ii i 

11 3826 A cCATTAcaccTgtATATATG TATAtGTGTA CATAACATACgTGTATGGAgGTaG 

mn ii mn m iii i mi 

33 3801 G aCATTA TAaGTGTA CATcACAagCcaaTATG 

16 3843 G tCt 

31 3812 - g C C 

18 3866 tgcaattccagatagtgtacaaatattggtgggataCa 
con a g~. catta 1— atatatggtatatgtgta— cataacaaacatgtatggaagtcg 
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6 3897 TGCCTGTACAAATaGCTGCAGgAACAACcAgcACATTcATAcT GCCTGTTaTaATTGCAT 

llillllllllll Illllll I I I I I I I Hill III I Illllll I Illllll 

11 3881 TGCCTGTACAAATTGCTGCAGcAACAACTAcAACATTGATATT GCCTGTTGTTATTGCAT 

llilll I II III Illllll I I I I I I | 

33 3833 TGCTGC TaA t T g tAT A tAAC CATG AT ATT t g TTTTTG taTTaTGTTTTaTaTT 

i iiiiii i Mini n in i i 

16 3847 atATGA caAAtcTtgatacTgcATccAcAaCATTAcTGgcgTGCTTTTTG CTTTGCTTT GTGTG 

MM II I I Mill II 1 1 till III 1 1 III III I Mill 

31 3815 TaATGATtgAActaAatattTcTAcagtaAgCATT gTGCtaTGCTTTTTG CTTTGCTTTTGTGTG 

I I II II I I II I II II Mill llillllllllll 

18 3904 TgAcaATgtAAtacAtatgcTgTAgtaccAatATgttatCacTtaTTTTTttatTTTGCTTTTGTGT 

con tgac-atacaa-ttgctgc-tgaacaaccA-cA — tt-ata-TgctttttggccTtt-cTtttgtgtt 

02 1 -CTGCAGGAACAACCAGCACATTCATACT GCCTGTTATAATTGCAT 

6 3957 TTGttGTATGTtTTgTTAGcATcaTACTTATtgTATggATATCTGAgTTTaTtGTgTAcACATCTGTG 

mi iiiiii ii mi ii iiiiiii in iiiimi in i ii ii iiiiinii 

11 3941 TTGcaGTATGTaTTcTTAGtATtgTACTTATaaTATTAATATCTGAtTTTnTaGTaTATACATCTGTG 

ill i i ii i iiiiii i ii in i i n 

33 3886 gT TTTtaTGctTaTcCtTATTAtTACGTCCtTTAATAcTTTCcaTTTCTACcTATgCtTggTTG 

i ill ii i i i inn iiiiiii i i i ii i inn iiiiii 

16 3911 CT TTT GTGTG TCTgCcTATTAATACGTCCgCTgcTttTGTCTGTGTCTACATAcaCATCATTA 

ii hi mimi i nmni ii i minim m n n ii 

31 3880 CTactaTTT GTGTGTCT tgTcATACGTCCaCTtgTgcTGTCTGTGTCggtATAtgCAaCAcTA 
18 3971 

con -tgctgtttg-tgtgt-tgcat— ta-tacgtccatt-atattttct-tttctgtatatacatc~ tg 

021-TTGTTGTATGTTTTGTTAGCATCATACTTATTGTATGGATATCTGAGTTTATTGTGTACACATCTGTG-021 

6 4025 CTaGTACTAACACTgCTTTTATATTTaCTaTTGTGGCTgcTATTAACAACCCCcTT GCAATTtTTcc 

it milium minimi n minim mimiimi n mm n 

11 4009 CTGGTACTAACACTTCTTTTATATTTGCTTTTGTGGCTttTATTAACAACCCCTTT GCAATTcTTTT 

inn i i i i i mini i i mil mi mi 

33 3950 CTGGT gTTGGTATTgcTgcTtTggGtgTTTGTGG gAtCtCCTTTaaaAATT TTTT 

I I llllllll " III II 

16 3974 aTAaT ATTGGTATT acTaTTgTGGaTAACaGCAgCCTCTgCgTTTaG 

in iii i i i ii iii i i m inn i ii i 

31 3943 cTAtT ATTaaTtgT gaTtTTaTGGgTtAttGCAaCCTCTcCaTTacG 
18 3971 

con ctagtac-tt-attttttttatatttgcttttgtggcttttatgaa-aac-cc-ttc-caattttt— 

02 1-CTAGTACTAACACTGCTTTTATATTTACTATTGTGGCTGCTATTAACAACCCCCTT GCAATTTTTCC -02 1 
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6 4092 TACTAACtCTactTGTGTCtTAcTgTCCcGCaTTgTATATACACtacTAtATTCT taccacA 

1 1 1 1 1 1 1 ii nun n mii ii imi mi in ii urn i 

11 4076 TACTAACaCTgtgTGTCTGCTATTTTCCTGCcTTTTATATACACataTACATTGTgcaaacgcAAcaA 

i mum i mum n n n 

33 4005 T — tTGCTAXTT GTTg t TTTTATAT tTA CCAaTGaTgTgtATtAATtt 

i inn m mi ii m i ii ii ii 

16 4021 gTGTTTtaTTGTATATATtaTATTT CTT TATA TA CCATTATTTTTAATaCATAC 

iiiii mmmi i i m im u imiim im urn 

31 3990 tTGTTT TTGTATATAT gtTgT GTT TATA TAtattCCATTATTTgTAATtCATAC 

18 3971 

con tactaacactgtatat-tgctattttcgttc-ttttatatactata-tccattgtttttaat-cata- 

02 1 -TACTAACTCTACTTGTGTGTTACTGTCCCGCATTGTATATACACTACTATATTGT TACCACA-021 

6 4154 cAgcaaTGATGcTAACaTGTCAaTTtAATGATGGaGAT ACcTGGcTGggTtTGTCGTTCTTatgTG 

i mn mi iiiii ii mum m n in n i mmmi ii 

11 4144 TAATG gTGATG tT AAC cTGTC ACTTaAATG ATGG t G AT ACATGGtTGTtTcTGTGGTTGTTtacTG 

I III I II II I I Ml II Mill III I II I II 

33 4051 TCATGCACAgcaTAtgacacaACaagAgTAATGTATAT ACATGtaTaTATTGTTtGTATATAtgTG 

mini i mmmi mi mm inn 

16 4075 ACATGCAC gcTTTTTAAttacaTAATGTATATGTACATAaTgtaATTGTT ACATATA 

iiiii ii mini ii i i i ii ii ii 

20 31 4044 ACATGCA tctTTTTTAA GTCAAcAgTaaCTTTTTT AC 

I I II Mill I 

18 3971 AtgcatgtatgtgtgctCcCAtgtcccgCTTTTgccAtctgtctgta 



10 



15 



25 



35 



con -catgcacatg-taac-t-t-Aattaaataatggagatgtacatggttg-tTtt-tg-t-t-tatgtg 

021* CAG CAATG ATG CT AACATG TCAATTTAAT G ATG G AG AT ACCTGGCTGGGTTTGTGGTTGTTATGTG-02 1 



6 4220 CcTTTaTTGTAGggaTgtTgGGgTTaTTaTT gaTgCAcTAtAGaGCTGTACAaGGggaTaAAc 

I III llllll I I II II II II I II II II f I ! I I I 1 I II I II 
11 4 210 C ATTT gTTGT AGcT G T acT t GG ATTG TTGT T acTaCATTAcAG gGCT GTAC ATGG TacTgAAA 

II II I III I Mlllllll I II IIIII 
33 4117 CA caTGGTGgTGTtTTAacATTGTTGTT gTTATTTT AgtTTTTTTTTTTTTGTA 

30 i ii ii in mini i n mm i mi 

16 4132 atTgTTGTATACcaTaActtactaTtTTttctTtTTTATTTTcaTatAtaaTTTTTTTTTTTGT 

I mini i n in i i n i i 

31 4081 TtGTGTATAC TgTTgtTTgTatTgGTattggTaTTggTaTTgq 

II I I I I II i 

18 4018 tgtgtgcGTaTgcAtgggtattggtatttgtgtatatTgtggTaataacGTcccctgccacagcaTtc 



con cattt-tt-gtg-a-t-ttag— tt-tt-tt-t-tt-ttt-ttt— a— g-t-t-tttttt-ttt-t- 

02 1-CCTTTATTGTAGGGATGTTCGGGTTATTATT GATGCACTATAGACCTGTACAAGGCGATAAAC-02 1 



6 4 283 Ac AC cAAATGTa a g AAGTGTAA CAAAC aCAActgTAaTGatGATTATGTaacTATGcattATacT 

i ii iiiiii imim mn m ii ii mnm mi n i 

4Q 11 4273 AaACTAAATGTgctAAGTGTAAATCAAACcgCAAtacTAcTGTgGATTATGTGTATATGtcacATgg? 

mn nm i in mm 

33 4171 ttACTAA TAAAT AccTTTATaTtttagcaGTGTAT 

I llll 
16 4196 TTgTTTgtTTgTTTTTTA 

nm ii mini 

4 c 31 4124 tattggTATtggTaTaaTaaacTTTTTTaCTTtTTTTTTA 

III | I I III I I II I 
18 4086 acagtaTATgtaTtTtgTttttTaTTgccCaTgTTacTattgcatatacatgctatattgtctttaca 

con a-actaaatgtattaagtgtaatt-t — cc-t— tttT-atgttgattaagtgtatatg tatact 

02 1 -ACACCAAATGTAAGAAGTGTAA CAAAC AC AAC TG T AAT G ATG ATT ATGTAACT ATG C ATT AT ACT -O 2 1 
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4164 
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con 





i mi nun 

iTaATGGa GATT ATG 

nun 

tATJATG 



aTaTAtATCAAtTAGAGTAAACCgTTTTTTATAtttgtaacaGTGTAtGc 

i ii inn mmnm iimmi mil i 

TgTAcATGAAcTAGAGTAAACC TTTTTTATAcagtgtgtgGTGTAcGt 



• ■ • ataaactgTTATTA 

mm 

TTATTA 
II 

8 4154 gtaattgtataggttgttttatacagtgtattgtacattgtatattttgttttataccttttaTgcTt 

g-taatggagattatg tatacatgaa-tagagtaaacc~ttttttatatt-ttaat-gt-tatt- 

02 1-ACTGATGGTGATTAT ATATATATGAATTAGAGTAAACCGTTTTTTATATTTGTAACAGTGTATGC-02 1 
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6 4413 TttgTATAccATggcacAtagTAGGGCcCGacGACGcAAgCGTGCGTCAGCtACACAgCTATATCAAA 

i mi ii i mm n mi n minimi inn minium 

11 4405 Tag t TATA tATAatgAAACcTAGGGCACGcaGACGtAAaCGTGCGTCAGCcACACAACTATATCAAA 

i i mi in i i nm ii n mmnm mi 

33 4213 AGACACAAACGaTCTaCAAGgCGCA AgCGTGCATCtGCaACACAACTATAcCAAA 

minim ill ill mi i iimm n n nm n in 

16 4228 CttaacaATGCGACACAAACGtTCTgCAAAACGCACaAAACGTGCATCgGCTACcCAACTtTATaAAA 

i nm mm m iniiimi mmn n nm in i m in 

31 4170 C cATGCGgtcCAAACGcTCTaCAAAACGCACTAAACGTGCgTCtGCTACaCAATTaTATcAAA 

I I III I I I I I I 

18 4222 tttgtattTttGtaatAAAaGtatggtAtccCaCcgTgccgcacgacgcaaacgggctTcggtaactg 

con tttgtatat-aga-acaAacgt-c-gcaagacgc-gtaaacgtgc-tc-gctacacaactatatcaaa 

021- TTTGTATACC ATG G C AC ATAGTAGG GCC CG ACG AC G C AAG CGTG CGTCAGCTAC ACAG CTATAT C AAA- 021 

6 44 81 C ATG tAAac t C AC TG G aACATGc CCCCCAG ATGTAATTC C T AAg GTgG AgC Ac Aac AC c ATTG C AG AT 

mi n mm nm mmmnnmim n n n i n iimmi 

11 4472 CATGCAAGGCCACTGGtACATGtCCCCCAGATGTAATTCCTAAAGTtGAACAtAcTACtATTGCAGAT 

I n i i i it 1 1 1 1 1 n ii n ii ii nm mmnm m i in n mm 

33 4268 CATGCAAGGCCACAGGc ACc TGcCCACCcGATGTTATTCCTAAAGTgGAAGGaAgTACcATaGCAGAT 

mini mi ii ii nm n mi nm n nm i n n n n 

16 4296 CATGCAAAcagGCAGGTACaTGTCCACCtGACaTTATACCTAAgGTtGAAGCcAaaACtATTGCtGAa 

mi ill mmn mm i in iiniiiiii i in i n inn n 

31 4233 CATGtAAAgcAGCAGGTACtTGTCCAtCaGACgTTATACCTAAaaTaGAACaTActACcATTGCaGAC 

.... i i i ii m i i mi i i i i ii i i 

18 4290 acTtatAtaaAaCAtGTAaacaatCtggtacatgTccACCTgAtgTtGttCcTAaggtggagGgcacC 

con caTgcaAagccaCagGtAcatgtcCaccagatgttat-CCTaAagTtGaacatAataccattGcagat 

021-CATGTAAACTCACTGGAACATGCCCCCCAGATGTAATTCCTAAGGTGGAGCACAACACCATTGCAGAT-021 
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6 4549 CAAATATTAAAATGCGGAAGtlTgGGGCTgTTTTTTGGaCGGTTgCCTATaCCcACgGGttCcGGcAC 

„ C . A iiiiiiiiiiiiiiiiiiii n inn iiiiiiii iiiii urn nun i n i 

11 4540 CAAATATT AAAATGCGG AAGcTTAGCG GTTTTTTTTG GTGGG TTAGGTATTGG t ACAG G g gCTC G T Ag 

„ iiiii i im it ii 1 1 1 1 1 1 1 1 ] 1 1 1 1 1 1 1 1 1 1 1 iiiiiiiiiii iiiii inn 

33 4336 CAAATtcTtAAATATGGcAGTTTAGGGGTTTTTTTTGGTGGtTTAGGTATTGGcACAGGcTCTGGTtC 

« A ,c* 1'iiLJ. 1,11111 111 1 11 H null um inn him nm n m i 

16 4364 CAAATATTAcAATATGGaAGTATGGGTGTaTTTTTTGGTGGGTTAGGaATTGGaACAGGgTCgGGTAC 

„ „ ft1 IHIMII I Hill lllllllllll llllll llllllll II IIIII. I II II lllll 
31 4301 CAAATATTAaggTATGGtAGTATGGGTGTtTTTTTTGGTGGGTTgGGtATTGGgtCcGGCTCtGGTAC 

„ CD ii I i i inn ii in m iiiii 

18 4 35 8 acgtTAgcAgataAaatattgcaatGgtcaagccTTGGTataTTtttgggTGGacttGGCataGGTAC 

con caaaTattaaaatatggaagttt-gGggttttttTTGGtgggTTaggtattGG-acaGGctctGGtac 

02 1 -CAAATATTAAAATGGGGAAGTTTGGCGGTGTTTTTTGGAGGGTTGGGTATAGGCACGGGTTCCGGCAC--02 1 

6 4617 TGGgGGTCGTaCtGGcTATgTtCCCTTacaAActTCTgCaAAaCCTtCTATTACTaGtGGGCCtatgG 

„ Ae in nun i ii in i inn n in i n in iiiiiiii i inn i 

11 4608 TGGcGGTCGTgCaGGgTATaTACCCTTgGGAAgcTCTCCcAAgCCTGCTATTACTgGgGGGCCagCaG 

mi i i ii in mi i ii i n i in i ii i 

33 4404 AGGtGGAaGgACTGGcTATgTACCtaTtGGtACtgacCCaCCtACAGCTgCAAtccCctTGCagCCTa 

m m i mil in in i ii ii ii ii mm n i i in 

16 4432 AGGcGGACGCACTGGgTATaTtCCatTgGGaACAaGgCCTCCcACAGCTaCAGAtaCAcTTgctCCTg 

n ii iiiiiiii iii i ii i i m i iii i mi i n ii i in 

31 4369 TGGgGGTCGCACTGGaTATgTcCCtcTtaGtACACGtCCTtCTACAGtaTCtGAggCAagTaTaCCTa 

m II I ii n I I ii i ii i ii. I I 

18 4426 TGGcaGTgCtACaGGgggTcgt aCagggtacAttCcattgggTgggcgtTCcaAt aCAgtggTg gaTg 

con tGGcgGtcGtaCtGGgtaTgttcC-ttgggaAct-ctcc — ctacagctactaatacag-gcc-cctg 

BE 1 1 -GAAGCXCXCCC AAGCCXGCXAX X-BE 1 1 
BE 1 2 -T ATATACCCTTGGGAAGCXCXCCCAAGCC XGCXAX -BE 1 2 
02 1 -TGGGGGTCGTACTGGCTATGTTCCCTTACAAACTTCTGCAAAACCTTCTATTACTAGTGGGCCTATGG-02 1 

6 4685 CtCGTCCtCCtGTGgTgGTGGAGCCTGTgGCCCCTTCgGATCCaTCtATTGTGTCtTTAATTGAaGAa 

i iiiii u m i milium mum nm u mum iiiiiiii ii 

11 4676 CACGTCCgCCaGTGcTTGTGGAGCCTGTTGCCCCTTCcGATCCcTCcATTGTGTCcTTAATTGAgGAg 

mm ii ii m ii nun mi u i u u mil inn u u 

33 4 472 TACGTCCtCCgGTtACTGTAGAcaCTGTTGGaCCTTtaGActCgTCTATAGTGTCaTTAATaGAAGAA 

ii i u ii i ii inn mi ii mi ii i mum u m i mm 

16 4500 TAAGACCcCCttTaACaGTAGAtCCTGTgGGcCCTTctGAtCCtTCTATAGTtTCTTTAGTgGAAGAA 

i iiiii it i i i ii iiiii n ii i ii ii iiiiiiii i i ii nun 

31 4437 TTAGACCaCCAgTTAgCaTtGAcCCTGTaGGtCCcTTGGACCCCTCTATAGTAagTcTTGTTGAAGAA 

io aaqm ii i ii u i i i i i i ii in im i i i iiiii i i 

18 4494 TTgGtCCtaCAcgTccCccaGtggtTaTtGaaCCtgTGGgCCCCaCagacccAtcTaTTGTTacAttA 

con t-cGtCCtcCagttac-gtaGagccTgTtGgcCCtt-gGa-cCctCtatagtgtcttTa-Ttgaagaa 
BE26-CGXCCXCCGGXXACXGXAGAXA-BE26 J J 2 2 - TCT AT TG TG T C NTT AATN G AAG AA 
BE27-GXCCXCCCGXXACXGXAGACACX-BE27 022 -GGATCCATCTATTGTGTCTTT AATTG AAGAA 
BE28-XCCXCCGGXXACXGXAGACACXGXXGGACCXXXAG-3E28 
021-CTCGTCCTCCTGTGGTGGTGGAGCCTGTGGCCCCTTCGGATCC-O21 
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6 4753 TCgGCaATcATTAAcCCaGGgGCgCC TGAaaTtGTgCCCCC TgCACAcGGTGGgTTTAC 

" " " ""I II II II II ill I II IIIII inn HIM iiiii 

11 4744 TC t G CTATT ATTAAtGC tGGTG CACC TCAggTgCTaCCCCC iAcIclg^TGicwiic 

I I 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 I |i HIM || ||( 

33 4540 ACaAGTTTTATaGAgGCaGGTGCACCA CccCCaTCtATTCC TACACcATCAGGtTTTga 

„ 11 iimiiii ii ii iiiiiiiii • inn inn mm hi 9 

16 4568 ^^^'^''^^^^^ATGCTGGTGCACCAaCatCTGtaCCtTCcATTCCcCCagatgtATCAGGaTTTag 

iiiii nun nun i ii i ii ii hi ii in 

31 4505 tCTGGaaTTgTTGATGTTGGTGC CCCTGCTCCTAtaCCacacCCTCCTacaACATCTGGGTTTGA 

1B ' in i i i iii mi n i ii n minimi 

IB 4562 ataGaggactccagTGTgGtTaCatcaggTGCaCCTAggCCtacgttTaCTggcACgTCTGGGTTTGA 

con -ctggtattatt-atGctGgtgCacca-ctgctgc-at~ c~ cccctcct-caccatctGGgTTT-a 

TCTAGTNTTATTAATGCAGGTGCACC-JJ22 

BE5-CAXXAACGCAGGGGCGCC XGAA-BE5 

BE 6 - GGCAAXCAXXAACGC AGGGGCG-BE 6 

BE7-GCAAXCAXXAACGCAGGGGCGCC XGAAAXXGXGCC-BE7 

OS-GTACCCCC TAC AC AGGGTG G CTTT AC 
02 2 -TCGGCAATCATTAACCCAGGGGCGCC TGAAATTGTGCCCCC TGCACACGGTGGGTTTAC -02 2 

6 4812 aAT t AC ATC c TCTGAAaCa ACTAC c C C TGCaATaTT g G ATG T aTCaGTT AC tAqTC AC ACTA 

ii « M Jd I 1111 1111,1 1 11111 11111 H tl UNI II III II I IIIIIIII 

11 4803 TATaACATCATCTGAAtCgACTACACCTGCtATTTTaGATGT gTCTGTT ACcAaTCACACTA 

„ AttQO i i H mm i minimi m i mi mm i i 

33 4599 TgTTACTACATCTGCAGATACTACACCTGCaATTATTAATGTttcaTCTGTTg gggAgtca tCTATTc 

„ J, iNNii ii i iiiii mum mn i i i n n 

16 4636 TATTACTACtTCaaCtGATACcACACCTGC tATATTA GATa 1 1 aAt Aa tACTGTTA 

«™ ill 1111 1 11111 111,1111 11 111 in i i mn 

31 4570 cATTgCTACaaCtGCaGAcACaACACCTGC aATTTTA GAT gtaACaAaTGTTA 

. q ,„ ft ''J, i ii ii mmii in I ii 

18 4630 tATaaCatCtgCgGgtacaACtACACCTGCggtTTTggatatcacaccttcgtctacctCtgtgtcTA 

con taTtaC—CatCtgcag— ACtACaCCTGCaatttTt-atgt—catctgtt— tac-act— ta-Ta 

05 -TATAACATCATCTGAATCGACTACACCTGCTATTTTAG ATGT GTCTGTT ACCAATCACACTA- 05 
02 2 -AATTACATCCTCTG AAACAACTACCCCTGCAATATTGGATGT ATCAGTT ACT AGTCAC AC T A - 0 2 2 

6 4874 CtACTA GTaTaTTTagAAATCCtgTcTTTACAGAACCtTCTGTAAcACAaCCCCAACCACCcGTG 

u 4865 XJLU1 iLLUi iiiUUL 1 i ■ i ■ 1 1 1 1 1 1 ■ mini in immmi in 

11 4865 Cc ^CTA GTgTgTTTcaA^TCCCcTgTTTACAGAACCgTCTGTAATACAgCCCCAACCACCTGTG 
33 4667 aaACTATTtCTACACATttAAATCCCACaTTTACTGAACCATC^ 

u 4692 cJLUJi llllllll * i'lU' 11 11 "I" iiiiiiiii i ii mm i mm 

16 4 692 C tACTgTTaCTACACATaAt AATCCCACTTTCACTGACCCATCTGTATTGCAGCCTCCAACACCTGCA 

3i 4 623 Jlllll i 11111 11111 11111 mmiimmmm iiiiiiiii 

J 1 gCACACATgAaAATCCTACTTTTACTGATCCATCTGTATTGCAGCCTCC tACACCTGCA 

1« ... 11 1 "I HI I HI I I I I I I I II I I I III | III 

18 4698 tttccacaacCAatttTaccAATCCTgCaTTTtCTGATCCgTCcaTtaTtgAagtTCCacaAaCTGgg 

con cta ctatta-taca-^TaaaAATCC-ac-TTtaCtGAaCCaTCtgTaatacAgcctCcaccacCtGc- 

05-CCACTA GTGTGTTTCAAAATCCCCTGTTTACAGAACCGTCTGTAATACAGCCCCAACCACCTGTG-05 
022-CTACTA GTATATTTAGAAATCCTGTCTTTACAGAACCTTCTGTAACACAACCCCAACCACCCGTG-022 

027-CTGCA 
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6 4939 CAGCCtAaTCCaCAtATAtTaATtTCTCCaCCcACtcTAACgTCaCAcCcTaTAGAgGAaATTCCttT 
Mill I Ml II III | || Mill II II III! I I I II I UN II Mill I 
11 4930 GAGGCCAgTGGtCAcATAcTtATaTCTGCCCCaACaaTAACaTCcCAACaTgTAGAAGACATTCCAcT 

II HI HI II I | I III Hit II I I Mil I Ml III! Ml I 
33 4735 GAAGCCtcTGGaCATTTTATAtTTTCTTCCCCtACTgTTAGcACACAAAgTTATGAAaACATaCCAAT 

III I II lllllll I MM II I III llll IIIII I I I I I II I I II II || 
16 4760 GAAACtggAGGgCATTTTAcACTTTCATCATCcACTATTAGtACACATAATTATGAAGAAATtCCTAT 

inn in inn Miiiimm mini imiiiimm inn inn 

31 4682 GAAAC a t C AGGTCATTT Ac TACTTTCATC ATCa t CT ATTAGc AC ACATAATTATGAG G AAAT AC CTAT 

it iiiii iiiiiii iiiii mm mi ii iiiiim n i 

18 4766 GAggtggCAGGTaATgTAtTtgTTggtaCCCCtaCatcTgGaACACATgggTATGAGGAAATACCTtT 

con GA-gc — c-GGtcAttTa-ta-TttcttC-cC-aCtattag-aCaCAtaattatCA-gAaAT-CCtaT 

05-GAGGCCAGTGGTCACATACTTATATCTGCCCCAACAATAACATCCCAACATGTAGAAGACATTCCACT-05 
02 2 -GAGGCTAATGGACATATATTAATTTCTGCACCCACTGTAACGTCACACCCTATAGAGGAAATTCCTTT-02 2 
02 7 -GAAGCCTCTGGACATTTTATATTTTCTTCCCCTACTGTTAGCACACAAAGTTATGAAAACATACCAAT-02 7 

6 5007 AGAtACTTTTGTgGTATCaTCTAGTGATAGcGGtCCTACATCCAGTACcCCTgTTCCTgGTaCTgcaC 

iii 1 1 m 1 1 it iiiii minimi it Minimum m nm n n 1 

11 4998 AGAcACTTTTGTTGTATCCTCTAGTGATAGTGGaCCTACATCCAGTACtCCTcTTCCTcGTgCTtttC 

ii it mum m i iiii mini ii n mi 1 n i 

33 4803 GGATACcTTTGTTGTTTCCACAgACa gTAGTAatGTAACATCaAGC ACgCCCATTCCAGGGTCTCGCC 

mm mi nm mi n n i nm mil nm iimmmii 

16 4828 GGATACATTTATTGTTagCACAAACccTAAcAcaGTAACtAGTAGCACACCCATaCCAGGGTCTCGCC 

i mi i ii i ! i nun ii ii in im i 1 1 1 1 1 mi 1 1 1 1 mm mi 

31 4750 GGATACATTTATTGTTTCTACTAAtaaTGAaAAcaTAACaAGTAGCACACCCATtCCAGGGGTGCGCC 

i mm i mi n n i i nm n n i n nm i 

18 4834 acAaACATTTgcTtcTTCTggTAcggggGAggAacccAttAGTAGtACcCCatTgCCtactGTGCGgC 

con -gAtACaTTTgttgtttccactaatgata— aac-aAca — tAG-AC-CCcaTtCC-gg-gctcgcC 

05-AGACACTTTTGTTGTATCCTCTAGTGATAGTGGACCTACATCCAGTACTCCTCTTCCTCGTCCTTTTC-05 
022-AGATACTTTTGTGGTATCATCTAGTGATAGCGGTCCTACATCCAGTACCCCTGTTCCTGGTACTGCAC-O22 

6 5075 CTCGGCCTCG tGTGGGccTaTATAGTCGTGCaTTgCAcC AGGTgCAGGTTACaGACCC tGCaTTTcTt 

mmmi nm i miimm n n inn muni nm n in i 

11 5066 CTCGCCCTCGgGTGGGTTTgTATAGTCGTGCcTTaCAgCAGGTACAGGTTACgCACCCcGCgTTTTTg 

ii i i ii i nm imim i 1 1 1 1 1 1 1 1 i i 

33 4871 CTCTGCCACGCCTtGGTTTATATAGTCG CAAt ACc C AACAGGTTA AGGTTGTtGACCCTGC 

i iimimn ii 1 1 1 1 1 1 1 1 1 1 1 ii ii immm i nm mum 

16 4896 CaGTGGCACGCCTAGGaTTATATAGTCG CACaAC AC AACAGGTTA AAGTTGTaGACCCTGC 

iiiii iiii iimmi i imim ii i nm inn i 

31 4818 GTcctGCACGTtTAGGgTTATATAGT AaGG C t AC AC AAC AAGT a A AAGTTaTTGAtCCaaC 

ii ill n i i ii ill i m mum m i i n 

IB 4902 GTgtaGCAgGTccccGccTtTAcAGT AgGGCctacCAACAAGT gtcAGTggcTaAcCCtga 

con ct-tggCacGtct-gG-tTaTAtAGTcgtgc-atg a — caaCAgGTtaca-gttgttga-cctgc 

05-CTCGGCCTCGGGTGGGTTTGTATAGTCGTGCCTTACAGCAGGTACAGGTTACGGACCCCGCGTTTTTG-05 
022-CTCGGCCTCGTGTGGGCCTATATAGTCGTGCATTGCACCAGGTGCAGGTTACAGACCCTGCATTTCTT-02 2 
02 7 -CTGTGGCACGCCTTGGTTTATATAGTCG CAATACCC AACAGGTTA AGGTTGTTGACCCTGC -02 7 
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6 5143 TCCACtCCtCAaCGcTTaaTtACaTAT CAtAACCCTGTaTATGAA CCgGAqGATG 

i, JJULiJL Ii I] 11 11 1 11 111 11 "linn nun n n im 

11 5134 TCCACgCCaCAgCGaTTggTAACTTAT GAcAACCCTGTcTATGAA GGaGAaGATG 

„ „„ J „ i mi i ii i ii ii mi i mi i in 

33 4932 TTTTtTAACatCgCCTcaTAAACTTATaACATATGATAATCCTCCATtTGAAaGctTtGAccctGAaG 

, e mi mi i n iiiiiiiii niimiiniiiinn nn i i n n 

16 4957 TTTTgTAACcaCTCCcAcTAAACTTATTACATATGATAATCCTGCATATGAAgCTaTAGATGtgGAta 

„ JiLLi 1111 1 1 11 minimi n mn mm i n in n 

31 4879 GTTTCTTAgtgCTCCAAaacAgCTAATTACATATGAaAACCCTGCcTATGAAacTgTAaATGCtGAaG 

ilium mi iiimiiimi inn ii i 

18 4963 GTTTCTTAc a cgTCCAtcc tc t t TAATTACATATGAc AACCC gGCctttG 

con ttttct-accactcctta-taacttATtacatatGAtAAcCCtgcatatgaaagt-taga-gc-gatg 

05- TCC ACGC CACAGCG ATTGG TAACTTAT GACAACCCTGTCTATGAA GGAGAAGATG-05 

02 2 -TCCACTCCTCAACGCTTAATTACATAT GATAACCCTGTATATGAA GGGG AGGATG- 022 

027-TTTTTTAACATCGCCTCATAAACTTATAACATATGATAATCCTGCATTTGAAAGCTTTGACCCTGAAG-02 7 

6 5198 TtAGTgTACAATTTAgtCATGAtTCTA TaCACAATGCACCTGATGAgGCtTTTATGGAcATa 

i in iiiiiiiii inn im i imiiiimiiiiii ii iiiiini ii 

11 5189 TaAC "TTACAATTTAccCATGAgTCTA TcCACAATGCACCTGATGAaCCaTTTATGGATATT 

„ Knnn i mi in ii i in inn i 

33 5000 Ac AC ATTAC AATTT c a aC ATAg T GAT A TatcaccTGCTCCTGATCCTGACTTTcTaGATATT 

, £ e „ e i 1 1 1 1 1 1 i in ii inn i iiiii imimuii i mn 

16 5025 AtACATTATAtTTTTCtagTAaTGATAatagtaTTAATATAGCTCCaGATCCTGACTTTtTgGATATa 

„ AM i i mn mn m mmiii u mn mm i mn 

31 4947 AatCTTTATACTTTTC caATAcatCgCaTAATATAGCcCCTGATCCcGACTTTcTaGATATT 

la Rn „ i ii i ii i i i i i ii i i mm i ii m i mm 

18 5013 AgcCTgTggACacTaCattaacattTgatcCtCgTAgTgatGttCCTGATtCaGAtTTTaTgGATATT 

con «-acttTacAattTac-cataattaTaat-ctcttaataatGctCCtGATcc-GacTTTaTgGAtATt 

OS -TAAGTTT AC AATTT ACCCATGAGTCTA TCCACAATGCACCTGATGAAGCATTTATGGATATT-05 
022 -TTAGTGTACAATTTAGTCATGATTCTA TAC ACAATGC ACCTGATGAGGCTTTTATGGACATA-0 2 2 

02 7 - ACACATTACAATTTCAACATAGTGATA . TATCACCTGCTCCTGATCCTGACTTTCTAGATATT-02 7 

6 5260 ATTcGttTgCAcAGACCtGCcATtgCGTCCcGACG tGGcCTTGTGCGgTacAGTCGCATTGGaCAACG 

» ».i iU L LJ.LUU 1 11 11 11111 1111 11 iiiiini i minimi mn . 

11 5251 ATTaGAcTACATAGACCaGCTATaACGTCCaGACGgGGTCTTGTGCGTTTTAGTCGCATTGGgCAACG 

33 5flfi5 ILLiiUUU Nil III II II 

33 5062 ATTGCATTACATAGGCCtGCTATtACaTCTcGtaGacaTacTGTGCGTTTTAGTAGAgTaGGTCAAAA 

„ mi mm i Mint n i n in. ill in mini in i n 

16 5093 gTTGCtTTACATAGGCCaGCatTaACCTCTaGgcGtAcTggcaTTAGgTAcAGTAGAaTtGGTAATAA 



immnmi i i ii im ii mm i niimi 

- * -r v ^ « 4 ws-a. x * AT AGGCCTGCc cTt AC CTC ac G t aG g Aac AC TG T T AGaTAT AGT AG AcTAG G T AATAA 

1fl enoi ii iiiiiiniiiii i n ii i i mm i i n i m 1 1 mn i 

18 5081 ATccgtcTACATAGGCCTGCttTaACaTCcaGgcGtgggACTGTTcGcTtTAGTAGAtTAGGTcAacg 



aTtg— tTaCAtAGgCCtGCtaT-aC-TCc-G-cGtggtactgT-cG-T-tAGTaGaaT-GGtcAa— 
JJ24-TACATAGGCCTGCTATAACNTCCAGNCGTGGTNNTGTGCGNTTTAGTAGA-JJ24 
05-ATTAGACTACATAGACCAGCTATAACGTCCAGACGGGGTCTTGTGCGTTTTAGTCGCATTGGGCAACG-05 

01 6-ACTGTGCGTTTTAGTAGAGTAGGTCAAAA-01 6 
022-ATTCGTTTGCACAGACCTGCCATTGCGTCCCGACGTGGCCTTGTGCGGTACAGTCGCATTGGACAACG-022 
027-ATTCCATTACATAGGCCTGCTATTACATCTCGTAGACATACTGTCCGTTTTAGTAGAGTAGGTCAAAA-027 

02 B - GTAG AC ATACTG TG CGTTTT ACTAG AGT AGG TCAAAA 
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6 532 B GGGCTCtATGcACACtCGCAGcCGAaAgCAcATAGGgGCCCGCATtCATTATT 

linn in mi inn m i n inn mum mini 

11 5319 GGGGTCcATGtACACaCGCACTCGAcAACAtATAGCtCCCCGCATACATTATT 

i i i i ii mum mi n n n i iimimi 

5 33 5130 AGc c ACACT t aAaACTC GCAGTGG tAAAGAaAT tGG aGCT AG aAT ACATTATT AT c Ag G ATTT AAGTc 

i mil Tim inn m n n mi iimmm i mnim 

I $ 5161 ACAAAC ACT a CG tACTC GT AG TGG aAAA t CT AT aGGTG CT A a GG TACATT ATTATT ATGATTT AAGT A 

nmi i ii mini mi tin inn i m mini mm in mi 

31 5077 ACAAACTtTGCGcACTCGTAGTGGTgCtaCTATtGGTGCaAGGGTgCATTATTATTATGATATtAGTA 

mi ii n ii n m i ii mn nm n i mi mini in 

10 18 5149 ggcAACTaTGtttACcCGcAGcGGTaCacaaATaGGTCCtAGGGTtCAcTtTTATcATGATATaAGTc 

con -g-aaCtaTgcacACtCCcAGtGG-aaacatATaCGtGCtagg-TaCAtTaTTatcatgatataagta 
05-GGGCTCCATGTACACACGCAGTGGACAACATATAGGTGCCCCCATACATTATT( -05 ) 
016 - AGCCACACTTAAAACTCGCAGTGGTAAACAAATTGGAGCTAGAATACATTATTATCAGG ATTTAAGTC-0 1 6 
022-GGGGTCTATGCACACTCGCAGCGGAAAGCACATAGGGGCCCGCATTCATTATT( -022 ) 
75 02 7 - AGCCACACTTAAAACTCGCAGTGGTAAACAAATT GGAGCTAG AATACATTATTATCAGGATTTAAGTC-02 7 

028 - AGCCACACTTAAAACTCGCAGTGGTAAACAAATTGGAGCTAG AATACATTATTATCAGGATOTAAGTC-02 8 

6 5381 TTtAtGAtATTTCACCtaTTgCACAgGCTGCAGAaGAAATAGAAaTGCACC rCTtGTGG 

ii i ii iimimi ii mi mmii iimiiii imiiiiii mi 

on 11 5372 TTCAgGAcATTTCACCagTTaCACAAGCTGCAGAgCAAATAGAAcTGCACCCTCTAGTGG 

I I III I I II II III I II 

33 5198 CTATTG TgcCtttAGAcCACaccgTgCcAAATgaACAAtaTgAATtAcAgcCTttaCaTgAtacT 

nmi i m i i ii i i i i i i ii i i 

16 5229 CTATTGATCCTGCAGaAGAAAtagaatTACaAAcTatAacAccTtCtaCAtAtACTACcACTTCacaT 

mi immii mi i i i i nm mn i 

« 31 5145 gTATTaATCCTGCAGgtGAAAgTATTGAAaTGCAaCCTTTAGgggCgTCTGCaACTACtACTTCtacT 

mi m i in mini mi mini mn i 

18 5217 cTATTgcaCCTtCcccaGAAtaTATTGAAcTGCAgCCTTTAG taTCTGC caCggag 

con ctattgatc-t-c agaacacat — taca-aagct-caa — g-aatc — aa-ctaccc — teg 

( 05- ) TTCAGGACATTTCACCAGTTACACAAGCTGCAG-05 

01 6 -CTATTG TGCCTTTAGACCACACCGTGCCAAATGAACAATATGAATTACAGCCTTTACATG ATACT-0 1 6 
022- TTTATGATATTTCACCTATTGCACAGGCTGCAG-022 

02 7 -CTATTG TGCCTTTAGACCACACCGTGCCAAATGAACAATATGAATTACAGCCTTTACATGATACT-02 7 
02 8 -CTATTG TGC CTTT AG AC C ACACCG TGC C AAAT G AAC AAT ATG AATT AC AG CC TTT A C ATG AT AC T- O 2 8 

6 54 41 CTGCAcAggATGAtACaTTTGATATTTATGCTGAAtCtTTTGAaCCTggCatTaACCCTacCCAACAc 

mn i mi ii miimimmm i mn in i i nm mm 

II 5432 CTGC Ag Aa aATG Ac AC GTTTG AT ATT T ATG C TGAAcC ATTTG AcCCTatCcc T g ACCC T g t CCAACAT 

II I I I II I Ml 

33 5263 tC t aC aTC g tCT t aTaGTATTAATg ATGG tTTgTATGATgTTTATGC TgaCgAtGT 

i i ii ii i iiiiiii mi ii nmi mini i i 

16 5297 gCAgCcTCacCTacTTcTATTAATaATCGA TTaTATGATaTTTATGCaGATgacttTattACAGA 

I I I I I III I llll II II III I III I 

31 5213 ttAAATGAtggCTTaTaTGAcATTTATGCAGA CAcTGATtTTaCTGTGGATacACCTgCcACAcA 

mn iii i iii ii mmii n n i i m i m 1 

18 5273 gacAATGA CTTgTtTGAtATaTATGCAGAtgaCAtgGAcccTgCaGTGccTgtACCatCgcgttC 

con -t-aa — a-a tat-T-ttAt-taTg-acag-ac-atgatatt — tgc taccctt-ccaa 

45 Ol 6 -TC TAC AT C GTC TT AT AGTATT AATG ATGG TTTGTATGATGTTTATGC TGACGATGT- 0 1 6 

027-TCTACATCGTCTTATAGTATTAATGATGG TTTGTATGATGTTTATGC TGACGATGT -02 7 

02 8 -TC TAC ATCGTCTTATAGTATTAATG ATGG TTTGTATGATGTTTATGC TG AC G ATG T -0 2 8 
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6 550$ cCTGrTACAaatatatcagAtaCaTArtTaACtTCCACACCTAATACagTT&CACAAcCCTGCGCTAA 

mum i i mi i it Minimum n mim miiimiii 

11 5500 tCTGTTACA c Ag t C t TATCT tAC c TCC ACAC CTAATAC CcTTtC ACAA t CGTGGGGTAA 

I I Ml I I I I I I II 

33 5319 ggaTaaTgtAcaCaCcccAAtgCaacaCTCATacAgtaCgTTtgCAaCaacaCgTACcaGcAATCTgt 

MIM I I I I I II II II III 

16 5362 TAcTtcTaCAaCCcCggtAcCatctgtacCCTCtACatCTTTaTCAGgtTATaTTCCTgCAAATACaA 

|| I I I I I I II I I II I II I Ml I III HUM! I 

31 5278 TAaTgtTTCCcCTtCtacTgCTgtAcAgTCCaCatCTgCTgTgTCTGCCTATgTaCCTACAAATACcA 

Ml III I I I I I I II I II MM I II I II III I 

18 5338 TAcTaccTCCttTgCattTttTaaAtAtTCgcCcaCTatatctTCTGCCTcTtcctaTAgtAATgtaA 

con ta-tttt-ca tctc — attca tcacctac — c-ttatcagcct-tc-ca-tagtaatgtaa 

016 -GGATAATGTACACACCCCAATGCAACACTCATACAGTACGTTTGCAACAACACGTACCAGCAATGTGT-0 1 6 

027 -GGATAATGTACACACCCCAATGCAACACTCATACAGTACGTTTGCAACAACACGTACCAGCAATGTGT-02 7 

028 -GGATAATGTACACACCCCAATGCAACACTCATACAGTACGTTTGCAAGAACACGTACCAGCAATGTGT-02 8 

"i 5577 cACCACAGTtCCATTGTCAcTtCCTAaTGACCtGTTTtTaCAaTCTGGcCCTGAtATAACTTTTCCTA 

MMMM lllllllll I MM MM MM I II Mill Mill MMIIIMMM 

11 5559 tACCACAGTcCCATTGTCAATcCCTAgTGACTGCTTTGTgCAGTCTGGgCCTGACATAACTTTTCCTA 

M I II II III I III III I I I I HI I 

33 5387 CtATACCTTTaaATacaGgATttqATAcTCCTGTTaTGTCtGGcCCTGATATACCtTccCcTTT AtTT 

I II Mill I I M Ml MM I I II II I M 1 1 1 M 1 1 1 I II I 

16 5430 CaATtCCTTTtgGTggtGcATacaATATTCCTtTAgTaTCaGGtCCTGATATACCcATtaATaTAaCT 

i i ii i ii ii i inn ii i it ii nun mi n i i 

31 5346 CtGTgCCacTAAGTaCaGgTTttGAcATTCCcaTATTtTCtGGgCCTGATgTACCtATagAgCATgCA 

i ii ii in i - ii ii i ii in i ii linn i i mi 

18 5406 CgGTcCCttTAAcctCctcTTggGAtgTgCCtgTATacaCgGGtCCTGAT AttacattAcCATctA 

con ca-taCctttaaatt-tg-aTtcgatat-cCtgt-tt-tc-ggtCctGat-taccataacattt-cta 

0 1 6-CTATACCTTTAAATACAGGATTTGATACTCCTGTTATGTCTGGCCCTGATATACCTTCCCCTTTATTT-016 
02 7-CTATACCTTTAAATACAGGATTTGATACTCCTGTTATGTCTGGCCCTGATATACCTTCCCCTTTATTT-027 
028 -CTATACCTTTAAATACAGGATTTGATACTCCTGTTATGTCTGGCCCTGATATACCTTCCCCTTTATTT-02 8 

6 5645 CTGCACCTATGGG AACACCCTTTAGTCCTGTAACTCCTGCTTTACCTAC AGGCCCTGTTTTcATTAC A 

Mill MMIMMIMMMMMIMIMIMMMMMMMMMMMMMII MMM 

11 5627 CTGCAtCTATGGGAAC ACCCTTTAGTCCTGTAACTCCTGCTTTACCTACAGGCCCTGTTTTTATTACA 

I I | MM I I Ml III Ml IN II M I 

33 5455 CcCacAtCTaGccCATTtgT TCCTATttCgCCTttTTTTCCTtttGACACcATTgTTGTaGAc 

i i ii mi i nun ii i m i n m m ii 

16 5498 gaCcaAgCTccTTCATTAaT TCCTATaCttCCagggTCTCCACAAtAtACaATTaTTGcTGAT 

i ii ii mi i i ii i mm i in m 1 1 i 

31 5414 CctaCACaGgtTTtCCCATT TCCTtTGGCCCCTaCaaCgC CAC AAg t G TC TATT t TTGTTG AT 

I I I I I MIMI 1 1 IMMM I I MM II MM I I M 

18 5472 CtacCtCtGtaTggCCCATTgtatcaCCcacGGCCCCTgCctCtaCACA GTaTATTggTaTacAT 

con ct-c-act-tgtg-ac-a-ttttagtCCtatagctCCtgctt-tcC-caag-c-ctaTTttt-ttgat 
BE22-CCCAXXGXAXCACCCACGGCCC-BE22 
BE23-CCCAXXGXAXCACCCACGCCCCCXGCCXCXACACA-BE23 
01 6 -CCCACATCTAGCCCATTTGT TCCTATTTCGCCTTTTTTTCCTTTTGACACCATTGTTGTAGAC-01 6 

027 -CCCACATCTAGCCCATTTGT TCCTATTTCGCCTTTTTTTCCTTTTGACACCATTGTTGTAGAC-02 7 

02 8 -CCCACATCTAGCCCATTTGT TCCTATTTCGCCTTTTTTTCCTTTTGACACCATTGTTGTAGAC-028 
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16 5561 
31 5477 
18 5537 



6 5713 ^yy^J^9 a WtTATTTGCATCCTgCATCCTAtTTTGCACCtAaACGCCGTAAACGTATTCCCtTATT 

33 5518 Gckiiili^iUJi 11111 1 1 " 111 1 I III IIIMI UN I II 
33 5518 CCTGCTCACrogtTTTACATCCra 

' yillll" Ml llllllll II Mill I 1 1 1 llllllll llllllll 

CcaGGTGACTTTTATTTACATCCTAGTTATTAcATGTTACGAaaaCGACGTAAACGTTTACCATATTT 

i mil iiiinii ii minimi mm i mimmii n mini 

GGgGGTGATTTTTATTTGCAcCCTAGTTATTATATGTTAaaAcgtCGACGTAAACGTGTAtCATATTT 

ii m mini n mm i i mmiimi i inn 

GGtacacATTaTTATTTGtggCCattaTATTATtTtaTtcctaagaaACGTAAACGTGTtcCcTATTT 

con GgtgctgacTtttaTTTgcatCCtag-TatTat-Ttttacgta-acgaCGTAAACGT-TtcC-TatTT 

JJ25-CGTAAACGTNTTCCCTATTT 

• MmB PCR2-CCTTTTCCATATTT 

"27- GGTG CTGACTTTGTTTTACATCC TAGTT ATTTT ATTTTACG TCG CAGGCGTAAACGTTTTCC AT ATTT -027 

6 5781 TTTTtCAGATGT GGCGGCCTAGCGACAGCACAGTATATGTGCCTCCTCCtAACCCTGTATCC 

ii 57« liUJiiJiiiil JLJLJL ' 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i i 1 1 1 f 1 1 mimmii 

11 5763 TTTTACAGATGT CCCGGCCTAGCCACAGCACAGTATAT'."7GCCTCCTCCcAACCCTGTATCC 

„ 1 11111111 minim ii mm n mum mum 

33 5586 TTTTACAGATGTCcgTgTGGCGGCCTAGTGAGGCCACAGTgTACcTGCCTCCT GTaCCTGTATCT 

u R „ a iim immi i mi immmimi u m mum u u mm 

16 5629 TTTTtCAGATGTCTCTtTGGCtGCCTAGTGAGGCCACTGTCTACTTGCCTCCT GTCCCAGTATCT 

,i ill! minimi mi mm urn miiiiuu u in iiimu m 

31 5545 TTTTaCAGATGTCTCTGTGCCGGCCTAGcGACGCTACTGTCTACTTaCCACCT CTCCCAGTGTCT 

. - „„ mi mm ii uiiummi ii in ii ii i mm i m i 

18 5605 TTTTgCAGATGgCTtTGTGGCGGCCTAGtCAcaaTACcGTaTAtcTtCCACCT CCttCtGTGgCa 

con TTTTaCAGATGtctctgtGGCgGCCTAG-GA— ccACaGTaTA— TgCCtCCTcc-gtccCtGTatCt 

TTTTNC AG ATG TCTN TGTG G CG GC CT AGTG A- J J 2 5 
PC R 1 - C AGATGTCTC TGTG G C G GCCTAGTG - PCR 1 
TTTTGC AG ATG - PCR 2 

02 7-TTTTACAGATGTCCGTGTGGCGGCCTAGTGAGGCCACAGTGTACCTGCCTCCT GTACCTGTATCT- 02 7 



6 5843 AAaGTTGTTGCCACGGATGCtTATGTTActCGCACCAACATATTTTATCATGCCAGCAGTTCTAGACT 

ii iiiiiiiniiiiiiii mini iiimmmmnmmmimiimiii 

AAgGTTGTTGCCACGGATGCgTATGTTAaaCGCACCAACATATTTTATCATCCCAGCAGTTCTAGACT 

ii mil in mi inn mil i in i mi mi i urn mil 

AAaGTTGTcAGCACtGATGAATATGTgtCtCGCACAAgCATtTATTATtATGCtGGtAGTTCCAGACT 

1 1 urn mil milium i mull m mm mi u i mum 

AAgGTTGTAAGCACGGATGAATATGTtgCACGCACAAACATATATTATCATGCAGGaAcaTCCAGACT 

ii Muiiummiimim mi u umiuium mu i i u u 

AAAGTTG T AAG C AC GGATG AATATGT a AC AC G a AC C AAC AT AT ATT ATCA C GC AG GCAGtgCTAGaCT 

i UilUii 11 11111 11111 111 mi mu mm u mu mi i 

AgAGTTGTAAatACcGATGAtTATGTgACtCccACaAgCATATtTTATCAtGCtGGCAGctCTAGatT 



11 5825 
33 5651 
16 5694 
31 5610 
18 5670 



con AaaGTTGTaagcACgGATGaaTATGTtac-CgcAC-AaCATaT-TTATcAtGC-gGcAgttCtAGacT 
JJ2 6-GTTGTNANCACGGATGANTATGTTACTCGCACAA- JJ26 
PCR3-AAGTTGTAAGCACCGATGAATATGT-PCR3 
LCR1A-AAGTTGTAAGCACGGATGAATATGT-LCR1A 

LCR1B-TGCACGCACAAACATATATTATCA-LCRB 
LCR1B ' -ACGTGCGTGrrTGrATArAArACrA-LCRB ' 
02 7 - AAAGTTGTC AGCACTGATGAATATGTGTCTCGCACAAGC ATTTATTATTATGCTGGTAGTTC CAG ACT-02 7 
6 5911 tCTTGCaGTGGGACATCCtTATTttTCcATaAAA cggGcTAA C AAAA CtGTTGTgC 
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Iim lllllllllll IIII II II III I Ml 

11 S893 CCTTGCTGTGGGACATCCATATTacTCTATcAAAAA aGtTAA 

, , llllllll II 1 1 f I i 1 1 1 1 1 Mill lllll III 

33 5719 tCTTGCTGTTGGcCATCCATATTTTTCTATTAAAAAtCCTAcTAA 

„ IIMI MIIU lllll llllllllll IIII II 

16 5762 aCTTCCAGTTGGaCATCCcTATTTTcCTATTAAAAAACCTAACAAT 

, Ml I M I II Mill MM I I II III II Mill II IIM II I II I 

31 5678 gCTTACAGTaCGcCATCCATATTaTtCcATacctAAAtCTGACAATccTAAaAAAA TAGTTGIaC 
II I 1 1 I I I f I I I III II I'll I I I I 



con 



I MM I Mill I 

C AAAA CAGTTGTAC 

I MM I I MM 

CgctAAAAAAtTATTgGTAC 

IIM MM II I 

AAcAAAA TATTaGTtC 



18 5738 atTaACtGTtGGtaATCCATATT 



TtagggttcCTCcaggTggTggcAAtAagcagGaTaTtC 



-cTtgC-GTtGGacATCCaTATTtttctaTtaaaaaacctgctaat- 
J J 2 7 - GTTGGACATC CAT ATTTT - J J 2 7 
02 7 - TC TTGC TGTTGGCCATC CAT ATTTTTCTATTAAAAATC CTACT AA 



-caacaaaAaa-tagttgTaC 
CGCTAAAAAATTATTG G TAC -027 



6 5967 CAAAGGTGTCaGGATATCAATAcAGgGTaTTTAAGGTgGTGTTaCCAGATCCTAACAAaTTTGCATTo 
llllllllll lllllllllll II II llllllll lllll llllllllllllll llllllll 

11 5949 CAAAGGTGTCtGGATATCAATATAGaGTgTTTAAGGTaGTGTTgCCAGATCCTAACAAgTTTGCATTa 
I H II II II I llllllll II IIII III II lllllllllll II IIII III 

33 5784 CcAAAGTATCAGGcTTgCAATATAGGGTtTTTAGGGTcCgTTTACCAGATCCTAATAAaTTTGGATTT 

ir c _ I lllllllllll II lllll lllll lllll I I Illlll II II Mill Mill III 

16 5824 CtAAAGTATCAGGATTACAATAcAGGGTATTTAGaaTaCaTTTACCtGAcCCcAATAAgTTTGGtTTT 

„ I H M llllllllllllll lllllllllll I I Illlll II I I II II lllll 111 

31 5743 CaAAGGTgTCAGGATTACAATATAGGGTATTTAGGGTtCgTTTACCaGAtCCAAAcAAATTTGGaTTT 

1D CDAA I HIM II I II llllllll lllllllllll I lllll II Mill illlllll II 

18 5800 CtAAGGTtTCtGcATacCAATATAGaGTATTTAGGGTgCagTTACC tGAcCCAAAtAAATTTGGtTTa 

con CaAAgGTgTCaGgaT — CAATAtAGgGTaTTTAgggT-c — TTa C C aG A t C C t AA - AAa TTTGga TT t 

J J 2 8 -C AAT ATAG GGTATTTAG G GTNC N GTTAC C - 28 30 -AAT AAATTTGG ATTN 
PCR 4 -GTTA TA TCCCA TAAA TCCCATGTTAA- PCR4 PCR5 - TTA TTTAAACCAAAA 
027-CCAAAGTATCAGGCTTGCAATATAGGGTTTTTAGGGTCCGTTTACCAGATCCTAATAAATTTGGATTT-027 



6 6035 CCTGAcTCgTCtCTtTTcGAtCCCACaACACAaCGTTTAGTATGGGCaTGCACAGGccTaGAGGTgGG 

HIM II II M II II lllll lllll llllllllllllll llllllll I lllll II 
11 6017 CCTGAtTCaTCCCTgTTTGACCCCACTACACAgCGTTTAGTATGGGCgTGCACAGGgtTgGAGGTAGG 

« cod I' IM I IIM I I MM Illlll II 1 1 1 1 1 f 1 1 1 1 1 II III I II IIM 

33 5852 CCTGACACCTCCTTTTATAACCCtGATACACAaCGaTTAGTATGGGCaTGTGTAGGccTTGAaaTAGG 

,c * QQ . Ji'Jii' IIIM "HUM- II llllllll II I M lllll 1 1 1 f 1 1 1 1 IIM IIM 

16 5892 CCTGACACCTCaTTTTATAATCCaGATACACAgCGgcTgGTTTGGGCCTGTGTAGGTgTTGAGGTAGG 

ii «.n llllLll ' i„ lllllllllll II II M II I llllllllllllll Ml I llllllll 

31 5811 CCTG ATACaTCTTTTTATAATCCTGAAAC tCAACGcTTAGTTTGGGCCTGTGTTGGTtTaGAGGT AGG 

i. ceo JULlili 1 ! i i ii iiii i iiii ii ii inn inn ii ii 1 1 iiim 1 1 in i ii 

18 5868 CCTGATACtagTaTTTATAATCCTGAAACaCAACGtTTAGTgTGGGCCTGTGcTGGagTgGAaaTtGG 

:on CCTGA-aC-tcttTtTataAtCCtga-ACaCAaCGttTaGTaTGGGCcTGtg-aGG— T-GAggTaGG 

CCTGACACCTCNNTTTATAAT-JJ30 
GGACTCrGCA-PCRS 

027-CCTGACACCTCCTTTTATAACCCTGATACACAACGATTAGTATGGGCATGTGTAGGCCTTGAAATAGG-027 
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6 6103 CAGGGGaCAgCCaTTAGGtCTgGGTGTaAGTGCaCATCCtTTcCTAAAtAAATATGATGATGTtCAAA 

nun ii ii inn ii urn inn urn n urn mnimiim mi 

11 6085 CAGGGGtCAaCCtTTAGGCGTTGGTGTtAGTGGgCATCCaTTgCTAAACAAATATCATGATGTaGAAA 

ii ii ii ii milium i urn urn n imiim mm mi 

33 5920 TAG aGG gC AG CC ATT AGGCGTTG G CATaAGTG G tC ATC CTTT ATT AAACAAATTTGATG AC AC t GAAA 

i i ii minimi ii urn inn iiimiimm inn imiiii mi 

16 5960 TCGtGGtCAGCCATTAGGTGTgGGCATTAGTCCcCATCCTTTATTAAATAAATTgCATGACACaGAAA 

m ii imiiimim n mum urn iiimiimm mimi mi 

31 5879 TCGcGGgCAGCCATTAGGTGTaGGtATTAGTGGtCATCCATTATTAAATAAATTtGATGACACTGAAA 

ii ii mil imim ii mini mum i iiiimi miumiui 

18 5936 CCGtGGtCAGCCtTTACCTGTtGGCcTTAGTGGgCATCCATTtTatAATAAATTaGATGACACTGAAA 

con — G-GGtCAgCCaTTAGGtGTtGG-aTtAGTGG-CATCC-TTattaAAtAAATttGATGAcactGAAA 

02 7 -TAGAGGGCAGCCATTAGGCGTTGGC ATAAGTGGTCATCCTTTATTAAAC AAATTTGATGACACTG AAA-02 7 



6 6171 AT tcaGGGagTGGTGGTAAcCCTGGaCAGGATAAcAGGGTTAATGTAGGTATCGATTATAAACAA 

ii m iiuiiui mu mum ummuuiummummi 

11 6153 ATaGTggtGGGTATGGTGGTAAtCCTGG.-'AGGATAATAGGGTTAATGTAGGTATGGATTATAAACAA 

ii mi iii i ii m 1 1 1 1 1 1 1 1 1 1 i u mumumii 

33 5988 ccGGTAacaaGTATcCTGGAcAaCCgGGTGctGATAATAGGGAATGTtTATCc ATGGATTATAAACAA 

i ii iii i i i i i mi mum mm mi mum nun 

16 6028 AtGCTAgTgctTATGCaGcAaATgCaGGTGtgGATAATAGaGAATGTATATCtATGGATTAcAAACAA 

i in i mu i i i ii mum milium mum mm 

31 5947 AcTCTAATagaTATGCcGgTggTcCTGGcactGATAATAGGGAATGTATATCaATGGATTATAAACAA 

ii ii i i i i ii mm i i ii i mum u 

18 6004 gtTCccATgccgccaCgtcTaaTgtTtctgagGAcgtTAGGGAcaaTgTgTCtgTaGATTATAAgCAg 

con at-ctaatgggtatgctggtaatcctggtgagGAtaatAGgGaa — TgTatctaTgGATTAtAAaCAa 

O 2 7 - CCG G T AAC AAGTATC CTG G ACAACCGGGTG CTG ATAATAG GG AATGTTT ATC CATGG ATTATAAACAA—O 2 7 



6 6236 ACaCAatTATGcATGGTtGGaTCTGCcCCcCCtTTgGGcGAgCATTGGGGTAAaGGTAaACAgTCTaC 

u ii mi urn u mu ii ii ii u u milium mi m m i 

11 6221 ACcCAGcTATGTATGGTgGGcTGTGCtCCaCCgTTAGGtGAACATTGGGGTAAgGGTAcACAaTGTtC 

u iii mu i i u m u ii in mmumm m in i 

33 6056 ACACAGTTATGTTTAcTTGGaTGTAAgCCtCCaAcAGGGGAACATTGGGGTAAAGGTgttgCtTGTAC 

mu u mm mi u ii ii ii i iiuiiui mu mu i mu 

16 6096 ACACAATTGTGTTTAaTTGGTTGCAAACCACCTATAGGGGAACAcTGGGGcAAAGGatCCCCaTGTAC 

mm 1 1 j 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 n u n u urn mu u mi 

31 6015 ACACAAcTGTGTTTAcTTGGTTGCAAACCACCTATTGGaG AgC AtTGGGGTAAAGG t AgTCCTTGTAg 

mu i in i i ii u u mini ii n mi mm i i mm 

18 6072 ACACAgtTaTGTaTttTgGGcTGtgccCCtgCTATTGGgGAaCAcTGGGcTAAAGGcAcTgCTTGTAa 

con ACaCA-tTaTGt-Ta-TtGG-TGt CCacCtataGCgGAaCAtTGGGgtAAaGGtactcctTGTac 

O27-ACACAGTTATGTTTACTTGGATGTAAGCCTCCAACAGGGGAACATTGGGGTAAAGGTGTTGCTTGTAC-027 
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6 6304 tAATACacCTGTAC^ggcTCGTCACTCCCCgCCcra 

mil mini iniiiiiiiii n n iinnnnnnninn I 
II I I II lit II II II II llllllll I i i mi Willi I 

33 6124 TAaTGCAGcACcTgCcaaTGATTGTCCACCtTTAGAACTTATAAAtACTaCTACTgAG(^TGGTG 

i inn i i i illinium nut I wn n wn www 

16 6164 eAATgtTGCAGTAAaTCCaGGTGATTGTCCACCATTAGAgTTAATAAAcACAGCTATTCAGGATGGTG 

ii in i i ii wiwiw iiiiiiii iw w iww ii wn i 

31 6083 tAAcaaTGCTartACcCCtGGTGATTGTCCtCCATTAGAAITAAAAAAtt^GCTATaCAAGATGGgG 

i ii i ii ii wn ii n ww i wn wn i mini i 

18 6140 atcgcgTcCTtTatCaCagGGcGATTGcCCcCCtrTAGAAcTtAAAAAcaCAGTTtTggAAGATGGtG 

con _aata-tgCtgtae--cctggtGAtTC-CC-CC-TTaGAacTtAtaAaeac-gTTaTacAgGATGGtG 

JJ36-GATGGTG 

02 7 - TAATGCAGCACCTGCCAATGATTGTCCACCTTTAGAACTTATAAATACTATTATTG AGGATGGTG-02 7 



6 6372 AtATGGTTGAcACAGGCTTTGGTGCTATGAATTTTGCtGAtTTgCAgACCAATAAATCaGATGTTCCt 

I IIHIIII || 1 1 1 1 1 1 II III III 1 1 III 1 1 III 11 II N III 1 1 w w iwwi 

11 63S7 AcATGG^GAtACAGGCTCTGGTGCTATGAAT 

i inn ii inn ww w wn i n n i ww mum 

33 6189 ATATGGTgGAcACAGGaTCTGGTtgcATC^ 

mim i ii ii inni urn mi wn iiiiiiii iwwi inn 

16 6232 ATATGGTTcATACtGGCTTTGGTGCTATGGAcTTTACTACATTACAGGCTAAcAAAAGTCAaGWCCa 

iiiiiiii mi mum iiiiiiii win i i n ww i wn 

31 6151 AraGC^TACaGGCTCTGCaG^ 

iww wn ii i iii ii wn iw i i H wn ww 11 U 11 

18 6208 ATATCGTaGATACtGGaTaTGGtGCcATGGAcTTTAgTaCaTTgCAAGAtACIAAAtGTgAgGTaCCa 

con AtATGGTtgAtACaGGcTtTGGtgctATGgAtTTTact-caTT-CA-gc-AatAAA-gtgAtGTtCCt 
ATATGGTTGATACAGGCTTTGGTGCTATGGA-3 6 37-CATTNCANGCNAATAAANCTGATGTTCCT 
027-ATATGGTGGACACAGGATTTGGTTGCATGGATTTTAAAACATTGCAGGCTAATAAAAGTGATGTTCCT-027 



6 6440 aTTGAcATaTGTGGcACTacaTGtAAATATCCaGATTATTTaCAAATGGCTGCAGACCCaTATGGTGA 

mi n mil in ii iiiiiiii wnw 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 m iwwi 

11 6425 cTTGATATTTGTGGaACTgtcTGCAAATATCCtGATTATTTgCAAATGGCTGCAGACCCTTATGGTGA 

imiiimin i i itiiniiiii iiiiiiii inn n i> niiiiinii 

33 6257 aTTGATATTTGTCGcAgTAcaTGCAAATATCCAGATTATTTa AAAATGaCTagtGAgCCTTATGGTGA 

i iiniiiii ii iiiiiimmimii i iiiiii „ ii ii mil II 

16 6300 cTGGATATTTGTAcaTCTATTTGCAAATATCCAGATTATaTTAAAATGGTgtCaGAaCCATATGGCGA 

mi 1 1 1 1 1 1 1 iimm iiiimmiiiii 1 1 1 1 1 1 1 1 1 1 mi JLJLIillUJULI 

31 6219 TTGGAcATTTGTAAtTCTATTTGTAAATATCCAGATTATcTTAAAATGGTTGCtGAgCCATATGGCGA 

mil nun i miimimimi mm i nm nmi 11 imiii 

18 6276 TTGGAtATTTGTcAgTCTATTTGTAAATATCCtGATTATtTacAAATGtcTGCaGAtCCtTATGGgGA 

con -T-GAtATtTGTgg— cTattTG-AAATATCCaGATTATtTa-AAATGgctgcaGA-CC-TATGGtGA 

NTNGATATTTGT-JJ37 J J 3 8 - AAAT AT C CAG ATT AT TT AN AAATG G - J J 3 8 
0 2 7 - ATTG ATATTTGTGGCAGTACATGC AAATATCCAG ATTATTTAAAAATGACTAGTG AGC CTTATGGTGA-0 2 7 
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6 6508 TACaTTal TT 11 TT tTCTaCGgAAGGAACAAATGTTTGCcAGACAtTTTTTTAAcAGGCCtCGcgagG 

in ii mini i i it immimiiiiii lllll llllllll Mill II I 

11 6493 TAGgTTg TTT T TTT aTtTgCGAAAGGAACAAATCTTTGCtAGACAC TTT 1 1 TAATAGGGCcCGTACtG 

in ii inn i i i ill i ii 1 1 m 1 1 h ii iimmmmmm inn 

33 6325 TAGtTTATTTITc^tTcTtCGLAcGtGAACAAATGTTTGTaAGACA L ' lI ' X ITTAATAGGGCTGGTACat 

ii mum i i i iii i imiimum inn n immimiiiiii 

16 6368 cAG CTTATTTTTTTATTT AC GAAGGG AACAAATG TTT CTtAG ACATTT a TTT AATAG GGCTGGT AC tG 

i imiiimiimii mimimmiii n inn mum i n 11 i 

31 6287 TACa TTATTTTTTT ATTT ACG tAG GG AACAAATG TTTGT aAG G CATTTTTTTAATAG A t C AC Gc ACg G 

i i i mini inn inn i mi immm mm mi n 

18 6344 Tt Cc aTgTTTTTTTgcTTAC G gcG tGAgCAgcTtTTTGctAGGCATTTTTg g AATAG AgCAGG tACt a 

con tag-tTaTrrTTtTattTaCGaaggGAaCAaaTgTTTG-tAGaCAtTTtTttAAtAGggCtGGtactg 

WO 86/05816-CAGG 

. 027 -TAGTTTATTTTTCTTTCTTCG ACGTGAAC AAATGTTTGTAAGACACTTTTTTAATAGGG CTGGTACAT-0 2 7 



6 6576 TGGGGGAACCTGTGCCTGATacaCTtaTaaTtAAgGGtaGTggaAAtcGcaCgTCTGTAGggAGTAGT 

1 1 M 1 1 1 1 1 1 1 1 M f I M 1 1 II I I 11 II II II I I IIIIIII llllll 

11 6561 TGGGGGAACCTGTGC^GATGACCTGTTggTaAAAGGggGTaatAAcAGatCaTCTGTAGctAGTAGT 

i ii ii mi ii 1 1 1 1 1 1 1 1 1 1 i iiiii iiiiiii n m 

33 6393 TaGGaGAggCTGTtCCcGATGACCTGTACATTAAAGC tTCaGGaACTACTGCcTCTaTtcaaAGc ACT 

i n ii m n ii n i milium n n mini i 1 n 

16 6436 TTGGTGAAaaTGTaCCaGAeGAtTTATACATTAAAGCCTCtCCgTCTACTGCa AaTTTAGCc AGttca 

iiiiiiii ii n n iiiii minimi n n n n I mm i 

31 6355 TTGGTGAAtCgGTcCCTactGACTTATATATTAAAGGCTCcGGTTCaACaGCTACTTTAGCtAaCaGT 

i nm i ii iii minimum! mm i iii i iii ii 

18 6412 TgGGTGAcaCtGTgCCTcaatcCTTATATATTAAAGGCaCaGGTatgcCtGCTtCacctGgcAgCtGT 

con TgGGtGAa-ctGTgCCtgatgac-Tata-aTtAAaCGctctggtactactgC-tct-tagc-Ag-agt 

TGGGGGAACCTGTGCCTGATACACTTATAATTAAGGGTAGTGGAAATCGCACGTCTGTA-W08 6/05816 
LCR 2 A- ACC TGTTGGT AAAAGGGG GTAATAA- LCR2 A 
LCR2A* -GGACAACCATTTTCCCCCATTATT-I t CR2A ' 

LCR2B-CAGATCATCTCTACCTAGTAGT 
LCR23 * -GTCTAGTAGACATCGATCATCA 
LCR3A-ATTTATACATTAAAGCCTCTGCCTC-LCR3A 
LCR3A ' - AAA TATGTAA TTTCCGAGA CCCAG- LCR3 A ' 

LCR3B-TACTGCAAATTTAGCCAGTTCA 
LCR3B 1 - A TGA CG TTT AAA TCGGTCAAGT 
LCR 4 A- C CTT AT AT ATTAAAG GC ACAGG T AT - LC R 4 A 
L CR 4 A ' - GAA TA TA TAA TTTCCGTG TCCA TA- LCR 4 A ' 

LCR 4 B -G C CTGCTT C AC CTGG C AGCTG T 
LCR4B' -CGGACGAAGTGGACCGTCGACA 
027-TAGGAGAGGCTGTTCCCGATGACCTGTACATTAAAGGTTCAGGAACTACTGCCTCTATTCAAAGCAGT-027 
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6 6644 ATaTATGTtaAcACcCCgACcGGCTCtTTGGTGTCcTCtGAGGCaCAATTgTTTAATAAgCCATATTC 

ii iUiiJLLI J' JUL I' 1,111 11111111 11 1 1 1 1 1 in II I I ii M ii I II I I II I 

11 6629 ATTTATGTaCAtACaCCTAGTGGCTCATTGGTGTCTTCAGAGGCTCAATTATTTAATAAaCCATATTG 

5 ««, JLLLJL LLJULLiJUJUL 1 1 1,1111 11,1 11111 ■ 1 1 1 1 1 1 1 1 

33 6461 gcTTtTTTTCCcACtCCTAGTGGaTCAATGGTTACTTCcCAatCTCAqTTATTTAATAAgCCAIATTG 

J^UiiJULU.UiUiU "i 1,1,1,11 11 mi inn ii inn 

16 6504 AaTTArmcCTACACCTAGTCGtTCtATGGTTACcTCtGATCCcCAAATATTcAATAAACCtTATTG 

„ i H hiim 1 1 1 1 1 1 1 1 ii ii ii i ii in ii iiiii mil ii im mi nut 

31 6423 AcaTAcTTTCCTACACCTAGcGCCTCcATGGTTACtTCaGATCCaCAAATtTTTAATAAACCATATTG 
70 1Q CAQn fc IM "I I H H Hill II UNI II II I II I lllllllllllllllll 

18 6480 gtgTAtTcTCCctCtCCaAGtCGCTCtATtGTTACcTCtGActCcCAgtTgTTTAATAAACCATATTG 
con attTattttcc-aCaCCtAGtGGcTCtaTgGTtaC-TCtGA-gC-CAatTaTTtAATAAaCCaTATTG 
AT-LCR2B J J 3 9 -G TT AC N T CTG AN G CNCAATT ATTT AAT AAAC CAT ATTG 

TAA-LCR2B ' 
AA-LCR3B 
TTA-LCR3B ' 
75 GT-LCR4B 

CAC-LCR4B ' 

02 7 'GCTTTTTTTCCCACTCCTAGTGGATCAATGGTTACTTCCG AATCTCACTTATTTAATAAGCC ATATTG-02 7 
6 6712 GCTaCAAAAaGCcCAGGGACATAACAATGGTATTTGtTGGGGtAAtCAacTGTTTGTTACTGTGGTAG 

JULLiiii 1 i 1 mmmii n inn n n nmmiiimim 

20 11 6697 GCTtCAAAAgGCtCAGGGACATAACAATGGTATTTGcTGGGGaAAcCActTGTTTGTTACTGTGGTAG 

UIJIL 11 11 11 1,111 miimm mil n n i mnnmnnn 

33 6529 GCIACAACGtGCACAaGGtCATAATAATGGTATTTGTTGGGGcAAtCAggTATTTGTTACTGTGGTAG 

, fi „„ i mini inn n n mmii ininimi n n minimni n i 

16 6572 GtTACAACGaGCACAGGGcCACAATAATGGcATTTGTTGGGGtAAcCAacTATTTGTTACTGTtGTtG 

„ £aa , i i mil ii inn iiinnmi iiimiim n n minnimi n i 

25 31 6491 GaTgCAACGtGCtCAGGGaCACAATAATGGTATTTGTTGGGGcAATCAgTTATTTGTTACTGTGGTAG 

i« «... Li li 11 11111 11 11 11,1,1 1 1 inn miiiiiiiiiinnii 

18 6548 GtTaCAtaagGCaCAGGG t C A tAAcAATGGTg TTTG c TGG ca t AATCAaTT ATTTG TT ACTG TGGTAG 

con GcTaCAa GCaCAgGGaCAtAA-AATGGtaTTTGtTGGggtAAtCAatTaTTTGTTACTGTgGTaG 

GCTACAANNNGCACA- J J 3 9 J 4 1-AATGGTATTTGTTGGGGTAATCAATTATTTGTTACTG TGGTAG 
C 6 -GCMCAGGGWC ATAAYAATGG-C 6 CI -CTGTGGTAG 

C7 - CTGTTGTTG 
C 8 -CTGTGGTAG 
CIO- CAGT TGTAG 
C 1 1 -CTGTGGTTG 
C12-CTGTTGTGG 
C13-CTGTTGTAG 

35 CI 4- CTGTGGTAG 

C 1 5 -CTGTAGTCG 

027-GCTACAACGTGCACAAGGTCATAATAATGGTATTTGTTGGGGCAATCAGGTATTTGTTACTGTGGTAG-027 
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€ 6780 ATAC C ACAC C CAG T AC c AAcATG AC A t TATG TGCATC cGTaaCTAc AT CT tC c ACATAC AC c 

ii UiliiUiiiiUU i' 111111 1111 111111 " i" 1111 1 'iiiiiii 

11 6765 ATACCACACCCACTACaAATATGACAcTATG TG CATC tGTot CTAaAT CT gCTAC ATACAC t 

„ „ M "inn niiiiii iniiiii mi ii ii iii iiiiiii 

33 6597 AT AC C AC t CGC AGTAC tAAT ATG AC tTTATG C a CAc a AGTaACT Ag t G Ac AgTACATAT AAA 

1C CCA „ '"I ii I I M 1 I I I IIIHI I lllll | | HI || | IIIIMIIII 

16 664 0 ATAC t ACAC GC AGT AC aAAT ATGTC aTTATGTG CT GCcATAtCtACTtcaGAaACTACATATAAA 

„ mi urn inn niiiiii i nun n u i i n mini mi 

31 6559 ATACC ACACG t AG TACCAAT ATGTC tgTtTGTG CT CCAATtgCaAacagtGATACTACATtTAAA 

la C£ , e iiiiiii i 1 1 1 1 1 1 1 1 1 i i i mm ii i i ii ii 

18 6616 ATACCACtCccAGTACCAATtTaaCaaTaTGTGCTtctaCAcagtCtcctgtaccTgggcaATaTgAt 

con ATACcACaCgcACTACcAAtaTgaCatTaTGtgct— tgCa g-aacta-ag-tactacATataaa 

ATACC- J J 4 1 C 1 6-CATCCGTAACTACATCTTCCA-C 1 6 

ATACC ACACGCAGTAC -CI C 1 7 -TCTGTGTCTAAATCTGCTACA-C 1 7 

ATACTACACGCAGTAC-C7 C 2 0 - CACACAAGTAACT AGTG AC AG-C 2 0 

ATACCACTCGCAGTAC-C8 C23-CAGTCTCCTGTACCTGGG-C23 

ATACTACTCGCAGCAC -CIO C3 1 -TTGCAAACAGTCATACTACATT-C3 1 
^TAC TACC CGTAGTAC - C 1 1 
ATACTACCAG AAGCAC -C 1 2 
ATAC TACT AG AAGCAC -C 1 3 
ATAC CACACGTAGTAC - C 1 4 
ACACTACCCGCAGTAC-C 1 5 

02 7 - ATAC CACTCG CAGT ACT AAT ATG ACTTTATG CACACAAGTAACTAGTGACAGTACATATAAA-02 7 

6 6842 AATTCtGATTATAAaGAgTACATGCGtCATGTGGAaGAGTaTGATTTACAaTTTATTTTTCAATTaTG 

ii iiiiiJLLUiii ihi 1 11,11111 1111 iiiiiiiii imiiiiiiiiii ii 

11 6827 AATTCAGATTATAAgGAATACATGCGcCATGTGGAgGAGTtTGATTTACAGTTTATTTTTCAATTgTG 

« iii UiLii 1 i ,m 11 1 1 11 1 1111 i 1 1 1 mum i ii 

33 6659 AAT g aAAATTTT AAa G AAT A t AT A aG ACATG T t G Aa G AAT ATG ATc T AC AGTTT gTTTTTC AAC T a TG 

i 6 fi , fl , iiL ,ii iiiii IUJ niiiiii n iiiiiiiii mum minim ii 

16 6705 AATAcTAAcTTTAAgGAGTACCTAcGACATGGgGAGGAATATGATTTACAGTTTATTTTTCAACTcTG 

fifi „ i n in inn iiiii n iimn m niiiiii iiiii inn i n 

31 6624 AgTAgTAAtTTTAAaGAGTAT tTAAGACATGGTGACGAATtTGATTTACAaTTTATaTTTCAGTTaTG 

i 8 «<m ii ii iiiii ii ,M 11,1111 iiiiiii) mm ii iiiii iiiimi n 

18 6684 gcTAccAAaTTTAAgcAGTATagcAGACATG tTGAGGAATaTGATTTgCAgTTTATtTTTCAGTTgTG 

con aaTactaAtTtTAA~gAgTA-ata-GaCATGt-GAgGAaTaTGATtTaCAgTTTaTtTTTCAatT-TG 

02 7 -AATGAAAATTTTAAAGAATATATAAGACATGTTG AAGAATATGATCTAC AGTTTGTTTTTCAACTATG-02 7 

6 6910 TAGCATTACATTgTCTGCtGAAGTaATGGCCTATATtCACACAATGAATCCcTCTGTTTTCGAaGACT 

,i iLUiiiiiii 1 1,111 1 mmmi immimiii mmim 

11 6895 TAGCATTAC ATTATCTGC AG AAGTc ATGGCCTAT AT aCAC AC AATG AATCC tTCTGTTTTGG AqGACT 

,L 1,11 1,1 1,1 1 1 1,1,1 11 i mum i mi mi i 

33 672 7 CAAAgTTACCTTAACTGCAGAAGTTATGACATATATtCATgCTATGAATCCagaTATTTTaGAaGAtT 

» iiiLLiiii 1111111111 11,11 1 " 1,1 iiiimi i mm n n i 

16 6773 CAAAATAACCTTAACTGCAGACGTTATGACATAcATaCATtCTATGAAT tCcaCTATTTTGGAqGAcT 

,1 <„, liiiUii' 111 1111,111 1 11,11111 11 11 iiiiiii i 1 1 1 1 1 1 1 r 1 1 ii i 

31 66 9 2 CAAAATAACaTTAtCTGCAGACaTaATGACATATATTCAcAGTATGAATcCtgCTATTTTGGAaGATT 

lft „„ i H H in iiiiiii i in i imim iiiiiiiii mm n im 

18 6 752 tAct ATtAC tTTAaCTGCAGAtgTtATGtCcTATATTCAt AGTATGAATagcagTATTTTaGAgGATT 

con -AaaaTtACaTTa-CTGCaGAagTtATGaC-TAtATtCA-actATGAATccc-cTaTTTTgGA-GA-T 
027- CAAAGTTAC C TT AAC TG CAG - 0 2 7 
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6 6978 GGAACTTTGGgTTATCGCCTCCcCCAAATGGTACAtTaGAaGATACcTATAGgTATGTgCACTCACAG 

„ <o R , ^iiUU]iJiiUi l,IM nil ii i ii iii i ii iiiii inn iii ii mi i ii 1 1 

11 6963 CGAACTTTGGTTTATCGCCTCCaCCAAATGGTACAcTgGAGGATACtTATAGaTATGTaCAGTCACAG 

,n «,o, !i i ' ||j 'U 1 1 I'm HI 'I I' I IINIM UNI I HI M Ml 

33 6795 GGcAaTTTGGTTT^aCACCTCCtCCAtcTGcTAgttTAcAGGATACcTATAGGTTTGTtaccTCtCAG 

lfi fifld1 1 1 i mi ii i ii Mini in iiiii inn 1 1 1 1 1 1 1 1 1 1 1 i in 

16 684 X GGAATTTTGGTcTAcaACCTCCcCCAggAGGcACacTAGAaGATACtTATAGGTTTGT aaCCCAG 

„ ««« ''''iU' 1 ' ' ' ' ' M III I I II IIIII lllllllllll I III 

31 6760 GGAATTTTGGatTgaCCaCaCCtCCctCAGGTtCTTTCGAGGATACcTATAGCTTTGTcaccTCaCAG 

,„ fifl , n MM MIM I M I II II I I IIIII llllll III I IIIII II 

18 6820 GGAAcTTTGGtgTtcCCcCcCCcCCaaCtacTagTTTGGtGGATACaTATcGtTTTGTacaaTCtgtt 
GGaAcTTTGGttTa-c-cCtCCcCCaactggtac-tT-gagGATACcTATaGgTtTGTaca-tCacag 

C21-TTGT AACCCAG 
C34-TTTGT AACCCAG 
C35-GTTTGT AACCCAG 



con 



11 



33 



7046 GCCATTACCTGTCAaAAgCCCACtCCTGAAAAgGAAAAgCcaGA 

1 1 1 1 1 1 1 1 1 M 1 1 1 II II M I llllllll IIIII I II 

7031 GCC ATTACCTGTCAg AAACCCAC ACCTGAAAAaGAAAAaCAgGA 

Mn II Mill IIIII Ml I I II I III Mill I II 

6863 GC tATTAC gTGTCAAAAAaCag tACCTCCAAAgGAAAAgG AAGA 

Ml I I i 1 1 1 1 1 1 1 Illllll II IIIII 



TCCTATAAGaAccTtAGTT 

IMMIMII I I MM 

TCCCTATAAGgAtaTgAGTT 

MM I I I 

CCCCTTAggtAAATAtACaT 
M I 



16 6906 GCaATTgCtTGTCAAAAAcaTaCACCTCCAgcaCCtAAaGAAGATgatCCCCTTA AAAAATAc AC t T 



31 6828 



18 6888 



Jl I MI1IMII I I II I r M II MUM 

GCcATTACaTGTCAAAAAacTGCcCCcCaAaagCCcAAgGAAGAT 

M Mill llllllll III II II I III 

GCtATTACcTGTCAAAAggaTGCtgCaCcggctgaaAAtaAgGAT 

GCcATTaCcTGTCAaAAacct-cacCtc-aaaggaaAAggaaGAt- 
GCAATTGCT-C21 C1B-CATACACCTCCAGCACCTAA-C18 
C 1 9 -GGATGCTGCACCGGCTGA-C 1 9 
C22-AAAAACAGTACCTCCAAAGGA-C22 
C27 - TTTTTGTCA TGGAGGTTTCCT-C 2 7 

C2 4 -CACACCTGAAAAAGAAAAAC AG-C 2 4 
C 2 8 -GTGTGGACTTTTTCTTTTTGTC-C2 8 
C25-CTCCTGAAAAGGAAAAGCCA-C25 
C 2 6 - GAGGACTTTTCZTTTTCGGT-C 2 6 
C2 9 -CCAAAAGCCCAAGGAAGAT 
C30-CAAAAGCCCAAGGAAGAT 
C3 2 -CAG AAACCCACACCTGAAAAAG A-C3 2 
C3 3-AGAAACCCACACCTGAAAAAGAA-C3 3 
GCAATTGCT-C34 02 3 -GGA 

GCAATTGCT-C35 015-GGAT 



.11 II I II 
CCaTTTA AAgAtTAtgtaT 

Illllll I 
CCcTaT gAtaAgTtaaagT 



— cCccttaaa-aA« 



•t-ta-tT 
JJ4 6-T 



C-C29 
C-C30 



TCCCTATAAGGATATGAGTT 
CCCTAT GATAAGTTAAAGT 
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6 


7X10 


11 


7095 


33 


6927 


16 


6973 


31 


6892 


18 


6952 


con 





illinium mmmmimi mum mini m i him mm 

TTCGGAGGTTAAcTTAAAACAAAACTTTTCAAGTGAATTAGATCAGTTTCCccTtGCACGtAACTTT 

mill n i inn mil mm u imiiimim i urn mm 

TTGGGAAGTggATTTAAAGGAAAAaTTTTCAGCAGAtTTAGATCAGTTTCCTTTgGGACGCAAGTTT 

iiiiiiiii imimmii inn inn immmmm nmm in 

TTG GG AAG TaAATTT AAAG GAAAAGTTTTCTGC AG Ac cTAG ATC AGTTTC CTTTaGG AC CC AAATTT 

mm ii immi mmiimimii imimimi i n iiiiiiiii 

TTCGGAgGTtAATTTAAAaGAAAAGTTTTCTGCAGAtTTAGATCAGTTTCCaCTgGGtCGCAAATTT 

mi i n mini imiinmi m nmm i m n n n mm 

ATTGGaAtGTggATTTAAAgGAAAAGTTTTCTttAGAcTTAGATCAATaTCCcCTtGGaCGtAAATTT 

TTTGGgAgGTtaAtTTAAA-GAAAAgTTTTCtgcaGA-tTaGATCAgTtTCCt-TgGGaCGcAA-TTT 
TTTGGGAGGTTAATTTAAANGAAAAGTTTTCTGCAGANTTAGATCA-JJ46 

C2-GATCAGTTTCCYYTKGGACG-C2 
C3-GATCAGTWTCCYYTKGGACG-C3 
C1-CTAGTCAWAGGIUUIHCCTGC-C7 
015 -TTTCGAATGTGG ATTTAAAGGAAAAGTTTTCrTTAG ACTTAG ATC AATATCCCCTTGGACGTAAATTT-0 1 5 
02 3-TTTGGGAGGTTAACTTAAAAGAAAACTTTTCAAGTGAATTAG ATCAGTTTCCCCTTGGACGTAAGTTT-02 3 

6 7178 TT gTT aCAAAGTGG ATATaGgGG ACGG tCcT 

n ii ii miiimm i mm i i 

11 7163 TTA TT gCAAAGTGGATATcGACGACCGaCgT 

III II III 

33 6995 TTA TTAcagGcAggtcttaaagcAaaAcctaaacttaaACGtgcAGcccccaCAtCcaCCcgCA 

1C „ in ill ii mi I i ii ii 

16 7041 TTA cTACAAGCAGGATtgAaGGCcaaACCAAAATTTAcAttAGGaAAACGaaaagCTaCACCCacCA 

m mi mini i m n mini i in nm i inn 

31 6960 TTA tTACAGGCAGGATatAGGGCacgtCCtAAATTTAaAgCAGGtAAACG TagTcCACCC t 

ii i nm mi i ii i i ii nm i 

18 7020 TT ggTtCAGGCtGGATtgcGtcgcaagCCcAccaTaggccCtcGcAAACG T tctg 

con TTa — ta — agcaggattgagggcaaaaccaaaaataa-a-cacgaaaa-gatatag-gcaccc-cct 

015-TT G GTTC AG G C TGG ATTG C G TC GC AAG C CC ACC AT AGG CCCT C G C AAACG T TCTG-015 
02 3 -TTA TT GCAAAGTGGATATCGAGGACGGACGT-023 

6 7209 CTatTCGTACAGGTgTtAAGCGCCCtGCTGTtTCcAAagCCTCTgCtGCCCCtAAACGtAAgCGccCC 

„ „ ii miiimi i nmm nm n n nm i nm nm n n n 

11 7194 CTgCTCGTACAGGTaTaAAGCGCCCaGCTGTgTCtAAgcCCTCTaCAGCCCCcAAACGaAAaCGTaCC 

I I I II II || | | | 

33 7059 CaTCgTCTgCAaaacgcAAaaaggttaaaaAATAAcAcTttGtgtaAttgtgtTAtgtTGTtgtTttg 

w ' " " ' ' ' II II I I I I III 

16 7108 CcTCATCTACCtCTACAACtGCtAAACG cAAAAAACgTAAGctGTAA GtaTTGTATGTA 

„ , M1 i mum nm i ii nm .nun m mi 1 iniim 

31 7021 CagCATCTACCACTACAcCaGCaAAACGtaAAAAAAC TAAAaaGTAAtgGatgTGTATGTAAtaCaT 

i nm mini i mi him 

18 7075 CtcCATCTgCCACTAC gtcttC TAAA CcTGccAagCgT 

con Ct-catcTaC-actacaaa c— a— t-aat-aa-gtaa-ctg-a-cc-ct-a-c-tgta— tcc- 

015-CTCCATCTGCCACTAC GTCTTC "TAAA CCTGCCAAGCGT-015 

023-CTGCTCGTACAGGTATAAAGCGCCCAGCTGTGTCTAAGCCCTCTACAGCCCCCAAACGAAAACGTACC-023 
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6 7277 AAAACtAAAAgGTAATATATGTGT aTaTCTacTGTT 

inn mi iniiiiimii i m mi 

XI 7262 AAAACcAAAAaGTAATATATGTCTgtcagTGTGTTGTGTT 

I I ill milium 

33 7127 TtcTGtcTAtGTactTtgtgTTCT TGTGTTGTGTTgtTGT 

c , M i ii i ii mi inn in 

16 7167 TgtTG a&T t a GTGT TGtTTGT ■ TGTGT ATATGT 

„ 1 1 1111 " 'in in nun 

31 7088 GTGTctgTatGTGTAtGTGCTTGTgctgtatTGT AT ATG TGTG T GT t t q t g t g t TATATA tg 

in mil mi ~ i iiimmni mm 

16 7113 GTG cGTGTAcGTGC . caGgaagtaATATGTGTGTGT gtatataTATATAcat 

con -t-tctataagtgtat-tgtttgtg tgtGtagtgt-tatc/tgtgtgt tatata 

015-GTG CGTGTACGTGC CAGGAAGTAATATGTGTGTGT GTATATATATATACAT -01 5 

023 -AAAACCAAAAAGTAATATATGTGTGTCAGTGTGTTGTGTT- ( 023 ) 

6 7313 AT 
II 

11 7302 ATtTaTATG T TGTTGTA gTGTGT 

i i mi i mini mn 

33 7167 TTGT TtTtTgTGTATG TGttacaaTgtATgTTATGTTGTATGTtacTGTGTTTG 

mi i i mm n i n in iiimm mini 

16 7199 TTGTATGTgctTGTATC TG CTTGTAaATATTAaGTTGTATGT GTGTTTG 

mini mm mm mi i mm m n 

31 7150 gTATaTG TATGTTTaTGTATG C gTGTGT aC TTGT AT AT AT GtaTaGTATGT TATGTgTG 

in i mn mn i mn mi i in i i mi i 

18 7164 cTAT t gt TgTGTTT GTATGtCcTGTGTttgtgtTTGT TgTAT G aTtgcattgTATG G 

con -tatttgtatgtttttgtatg-c-tgtgt-tgt-cttgtatatattotgttgtatgtt-gtgtgtttg 

015-CTATTGTTGTGTTT GTATGTCCTGTGTTTGTGTTTGT TGTAT G ATTGCATTGTATG G-015 
023- ATTTATATG T TGTTGTA GTGTGT (-023 ) 

6 7315 aTATATGT 
1 7325 aTATGT TtCTTGT atTGTG TATATGT 

mn i mn nn i mn 

3 7221 T ttTATGTgTaCTTGTttGTGTGcATGTTcTATGTacttgt 

i mn mi 

6 7248 TATGTATG gtaTAATAAA cacG TGTGT ATGT 

immi imm iiimm 

1 7209 TATGTATGctatgtaTGTTAATAAAtatgtgtatacctgtgtgtgtTGTGTATGTTGTcctTataTAc 

mimi mi iiiiiiiiii i ii 

8 7221 TATGTATG gtTGTT gtTGTATGTTGTatgTtacTAt 

con tatgtatg tgttaataaa ttatgt-ttcttgtt-gtgtgtatgtt-tatgta — tat 

0 1 5 -TATGTATG GTTGTT GTTGTATGTTGTATGTTACTAT-0 1 5 

023- ATATGT TTCTTGT ATTGTG TATATGT ( -023 ) 
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6 7323 GT gTATGTAcTCT 

II llllll III 
11 7351 GTataTGTTTGTGTATATGT GTAT gtTATGTA TGT 

i ii mi nun in mi mm in 

33 7262 c a g tTt c cTGTtXGTGT ATATGT t a ataaaacatt gTgTGTAT tT g tT a aAc tATt t gTATGT A TGT 

II I I I I I I I II II I 

16 7279 GTtTtTaAaTgcTTgtgtAACTATTGT gTcATgcAacATaAaTAaacttatT 

I II I II 1 I I I I I 1 1 I I I I 

31 7277 AccctaTtagtaacatacTaTtAcTAtTTtataAACTATTGTtCCTActTgtTcctAcTtgttCCTgc 

II .1 III I I I I I III 

18 7257 AtttgtTggtatgtggcaTtaaAtaAaaTatgttttgtggtTctgTgtgTtaTgtggtTgcgcCCTag 

con a tat-tgtttgtgtatat-ataatataagaaactatgttttttatgtaatatt — Tatgtactgt 

015 - ATTTGTTGGTATGTGGCATTAAATAAAATATGTTTTGTGGTTCTGTGTGTTATGTGGTTGCGCCCTAG-0 1 5 
023- GTATATGTTTGTGTATATGT GTAT GTTATGTA TGT-023 

6 7336 TATGT aTATGT GTg TGTGTGTTc t GTGTGT a a t g t aAg t-TATTTG TG tAATGTGTAT GTgTGTT 

mil mil ii ilium mm i mum milium mi 

11 7386 TATGTtgtTATGTatGTtTGTGTGTTtaGTGTGT gtATaTATTTGTGGAATGTGTATGTATGTT 

urn i n ii i mi ii ii n 

33 7329 TATGT AtatgggtgtaccTataTGaGTAagGagTTgTATTgcTtCccctacCcTGCATTgc 

I I I I I I I I I III I 

16 7331 gtTTCaacAcctACtaaTtgTgtTgTggtTaTtcAtTGTATaTaAactaTatTtGctACATcCtgTtt 

III I I I I I llllll MM 

31 7345 TccTCccaAtagtCATgTacTTaTtTctgccTatAaTTTAggTgTcacgccaTaGTaAaAgTtgtaca 

I I III II I II I II 

18 7325 TgagtaacAactgtATtTgtgTtTgTggtatgggtgTTgcttgtTgggctatataTtgtccTgtattt 

con tatgtaa-aa-gt-attttgt-tttT-tgtgtgtaatgtattttatttgt-taa-ttgtatgt-tttt 

015-TGAGTAACAACTGTATTTGTGTTTGTGGTATGGGTGTTGCTTGTTGGGCTATATATTGTCCTGTATTT-015 
02 3 -TATGTTGTTATGTATGTTTGTGTGTTTAGTGTGT GTATATATTTGTGGAATGTGTATGTATGTT -02 3 

6 74 00 TaTGTGCAATAAACAATTAcctcTtgtTacacCCTGT gACtCAGTGgctgttgcacgcGTTtTGgT 

i mmiiiiiiimi i i mil ii urn m 11 i 

11 7450 TtTGTGCAATAAACAATTA TTatgTgtgtCCTGTTACACcCAGTG actaaGTTgTGtT 

III I III I I I III I II 

33 7390 aaTGTaCcTAccTttATTtcccTaTAtTtgtAGtaCCTACATGTttaGTattgCtttacCtTTTGaca 

I II I I Ml II II III I I I III 

16 7399 ttgtTTtaTATaTactaTAtTtTgTAgcgcCAGgcCCatTTTGTaGCtTCaAcCgaAttCggTTGcat 

III M I III Hill I 

31 7413 CccGgTccgtTtTTtgcaACTaaAgctacTCCATTTTgaTTTtatGCagCCAtTTTAaaTcccTAACC 

ill i ii i mum i i m i mi 

18 739 3 CaaGtTataaaacTgcacACcttAcagcaTCCATTTTatccTacaatcctCcaTTTtgcTgtgcAACC 

con tatgttcaa-aatt-attaccttata-t-tcc-tt-t-acat-cagtg-c-atttta — cgttt-act 

015 - C AAGT T AT AAAACT G C AC AC CTT AC AGC ATC C ATTTT ATC C T ACAAT C CTCC ATTTTGC TG TG C AAC C - 0 1 5 
02 3 -TTTGTGCAATAAACAATTA TTATGTGTGTCCTGTTACACCCAGTG ACTAAGTTGTGTT-02 3 

024-GAATTCGGTTCCAT 
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25 



30 



6 7466 TTGCACCCGCCtTacacacataagTaATATacaTgcAcaATATATATaTtTttgtTtaaaATACTAT 

milium i i mm i i iiiiiiii i i i mini 

11 7508 TTCCACGCGCCgTtTgtgttgccTTCATAT TatAtTATATATATtTgTaataTacCTATACTATg 

I I II M III I I I II I I I II I 

33 7458 TacTAgTGtCCaTATtgtacaaTTTCcTccattTTgTATGcCTAaccgTtTtcggTtACTTgCCAtac > 

I II I I III II llll I II II III 

16 7467 GcTTTtTGgCaCAAaaTgTgttTTTttaAaTAgTTCTATGtCagcaacTaTGgTtTaAacTTGTAcCT 

I m i ii it i iii urn in hi ii 

31 748 1 CtTTTCGGTTGCAttgTtTaaacaTgctAgTAcaaCTATGctgatgcagtaGTTcTGcggTTtTTgGT 

i minim n ii i i i 

18 7461 GaTTTCGGTTGC CtttggcTTaTGtctgTggTttT 

con -ttt-cgg-ccctat-t-ta-a-ttc-tataa-t-ctatgt-tatat-ttt-tt-T-actttgct-tt 

0 1 5 -GATTTCGGTTGC CTTTGGCTTATGTCTGTGGTTTT-O 1 5 

023 -TTGCACGCGCCGTTTGTGTTGCCTTC ATAT T ATATTAT ATATATTTGT AATAT ACCTAT AC T ATG -O 2 3 

024 - GCTTTTTGGCAC AAAATG TGTTTTTTTAAATAGTTCTATGTCAGCAAC T ATG GTTTAAACTTGT AC GT -02 4 



6 7533 aC t tt t a t AtTTGCAACCGTTTTCCGTTGCCCTTAgC ATACACTTtCCaCc AATTTGTTAc AAC 

i i immimmiiiimmi iimim n i imimi m 

11 757 3 tTACCCcccccCAcTTGCAACCGT* fTCGGTTGCCCTTA C AT ACA CTT aC C TC aAATTTGTT A t AAC 

lllll II I III II III llll I III 

P0 33 7526 aTACCCtaTgaCAtTGCCAGaacAgTTaaTccTTTtCTttCCTGCACTCtgtTtgTcTgTACTtgctg 

i i i i ii i i i i iii ii iimim i ii 

16 7535 TTCCTG cTtgCcaTGcgtGccaAaTcccTgtTTTcCTgaCCTGCACTG cTTgccaACcaTtcc 

mm I II I 

31 7549 TTCCTG aaTAcTagTTTttGCcaacaTTCTggcTtgTagt 

II II I III II Ml I I 

18 7496 cTgCacaatacagtacgctggcactattgcaaacttTAaTctTTTggGCactgcTcCTacaTatTttg 



tt-c-ct-tt-catt-gcagcc — tttcg-tt-ctcttatc-T-cact— c-tcttct-tattata-c 
015-CTGCACAATACAGTACGCTGGCACTATTGCAAACTTTAATCTTTTGGGCACTGCTCCTACATATTTTG-O15 
023- TT AC CCCC C CCC AC TTG C AAC C GTTTTCG GTTG CC C TTA CATACACTTACCTCAAATTTGTTATAAC-02 3 
02 4 -TTCCTG CTTGCCATGCGTGCCAAATCCCTGTTTTCCTGACCTGCACTG CTTGCCAACCATTCC-024 



6 7597 GTGTTTccTctTAATCCtATATattTTGTG CcAGGTACAcATTCCCCTGCCAAGTtgCTTGCCAA 

mm i mm mi urn i mini mmmiimi iiiiiiii 

11 7640 GTGTTTtgTACTAATCCcATAT gTTGTGtgcCAAGGTACAtATTGCCCTGCCAAGTatCTTGCCAA 

I III I II II II II I lllll I 

33 7594 caTTggcaTACatAcCCtATgacatTgGCagaaCAgTtAATcctTTtCTTTcCTgcacTgtgTTtgtc 

35 , I I II I II I I I II I I I 

16 7598 aTTgTtttTtACACtgCacTatgtgcaACtActgAaTCAcTaTgTaCATTgtgTCataTAAaaTaaaT 

I I I llll I ~ I I I II II II I I II I 

31 7589 tTCcTgccTaACACacCTTgccaaCATATAAtccAgTCCaacTtTGCAATTAtaCtATgAAtCatgtT 

i ii iii mini i ii mi n i 

18 7 564 aaCaattggcgCgCctCTTtggcgCATATAA ggCgcaccTGgtATTA gtcATtttcCtgtcc 

40 con -t-ttt — ta-ca-tcctatat tt-taa-ccaa~g-aca — Ttgc-tt-c — aatt — ttt — a- 

015-AACAATTGGCGCGCCTCTTTGGCGCATATAA GGCGCACCTGGTATTA GTCATTTTCCTGTCC -01 5 

023 - GTGTTTTGTACTAATCCCATAT G- 02 3 

024 -ATTGTTTTTTACACTGCACTATGTGCAACTACTGAATCACTATGTACATTGTGTCATATAAAATAAAT-02 4 

45 
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6 7662 gtgcatcatatcctgccaaCcACACACCTGGCgcCAGGGtCCCGTATTCC cTtactcATAA 

n „ os i iniiiiiiii inn iniiiiiii i mi 

11 7706 — CAACACACCTCGC CAGGG cGC CCTATTGCaTg AcTa aTgTAc aATAA 

33 7662 tgtacTtgctgcAttgacTCAtatataCatGCAGtgcaATtgcaalaTaCTTaACT^ 
16 7666 cacTaTgcgc^CgcctTacatACcgCtgtTAGgcacATa^ 
31 7657 tGtTtaaaTACAACtgtagttcaACtATgtgTcatgcAcara^ 

,o I ' 111111 11 1 I "III llll I I III 

18 7626 aGgTgcgcTACAAC aATtgcTtgcatAacTATAT ccactcCCTA AgtaaTaAAA 

con tg-tatg-tacaacgccatc-a-acaactgg-agca-aatt-tata-t-cttt-cta-a~ actaaaa 

BE31-XXAGGCACAXAXXXX-BE31 hUXl&+*8+33 
015-AGGTGCGCTACAAC AATTGCTTGCATAACTATAT CCACTCCCTA AGT AATAAAA-0 1 5 

02 4 -CACTATGCGCCAACGCCTTACATACCGCTGTTAGGCACATATTTTTGGCTTGTTTTAACTAACCTAAT-02 4 

6 7723 ACCTGTC TTTGTgttAtAcTtTTaTGcAcTGtAGCCAActcTTAAAAGCATTTTTGGCTTcTAGCa 

ii iiiiilJL ILLL1 'J i "in i ii iiiiii m i mi i urn inn mi 

11 7753 ACCTG TCG g TTTGT ACAaTgTTgTGgATTGCAGCCAAAggTTAAAAGCATTTTTGGCTTcTAGCt 

i i m HI II I II I llll llll III MINIMI I 

33 7730 TaCAcATGcTTTtaggcACATAtTTTTactTTaCtttCAAAccTTAAgtGCAGTTTTGGCTT aCa 

INN II II IIIIII llll I I 

16 7734 TgCATATTtGGCAtAaggTTTAaacTTcTAaggCcAaCtAAatgTcAccctAGTTCaTaCaTcaActq 

I N llll I III II || | III 

31 7725 CTGCTTTTAGGCACATATTTT C-TagaTTATctaTAtCctTgATTGCAgtgcTGGCTTttgcacAtgt 

iNiiiiiiiiiiiiiiiNi iiiiii i iiiiii mm 

18 7680 CTGCTTTTAGGCACATATTTTaGTttgTTtTtacTtaagcTaATTGCAtactTGGCTT 

con — c-tttta atataat-tagtttt-tattgct — caaa — tTaaa-gcattt-t-gcttgtagc- 

BE31-XXAGGCACAXAXXXX-BE31 hpvl6+Z8+12 
BE3 1-XXACGCACAXAXXXX-BE3 1 hpvl 
O 1 5-CTGCTTTTAGGCACATATTTTAGTTTGTTTTTACTTAAGCTAATTGCATACTTGGCTT- ( 015 ) 
02 4 -TGCATATTTGGCATAAGGTTTAAACTTCTAAGGCCAACTAAATGTCACCCTAGTTCATACATGAACTG-02 4 

6 7789 GcACATTTTTtTgCtCTTAcTgTtTggTatACAATAaCataAAAATGAGTAACCTAAGGTCACACACC 

ii 7.1. 1 lUJJJi! 1 1 1111 1111 111,11 1 f 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 II 1 1 1 1 1 1 1 1 

11 7818 GAACATTTTTGTACcCTTAgTaTaTtaTgcACAATAcCcacAAAATGAGTAACCTAAGGTCACACACC 
N IIIIII II I || | || 

cAAttgcTTTGTAtgCcaAactaTgccTTGTAAAAgtgagtcActacctgttTaTtAccaGGTGTGga 

,« 11 11 11111111 1 nun 

16 7802 Tgt AAa gGTT Agt c aTac AT t gT TCATTTGTAAAA cTgcAcatqGGTGTGtcj 

I II I I III Nil III | 
31 7792 TtaAActGccAaggTTgtgTcaTgCATTaTaAATAagttgTatgttactcaTATAATtaATtgCatAt 

„„ N I I | | | NUN II I | 

18 773 8 gtacaactacTTtcaTgtccaAcatTctgTctacccTtaacatgaacTATAAT ATgaCtaAg 

con -aa-attttt-tact-ttatt-tt-a-tttaaaaaaac-gtaaa-tg — tat — taagga-g — ta 

0 1 5 - GTACAACTACTTTCATGTCCAACATTCTGTCTACCCTTAACATGAACTATAAT ATGACTAAG -0 1 5 
024 - TGTAAAGGTTAGTCATAC ATTGTTCATTTGTAAAA - CTGCACATGGGTGTGTG-02 4 
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6 7857 TGCgACCGGTTTCGGTTAtCCACACCCTACATATTTCCTTCTTATA 

< HI iiiliniiiitii lliiiiiiiiiiiiiiiiiiiiniii 

11 7886 TGCAACCGGTTTCGGTTAcCCACACCCTACATATTTCCTTCTTATA 

.„ M Ml III I III II II (III 

33 7863 CtAACCG TTTTaGGTcAtaTTggtCATTTA tAaTCtTTTATATAATA 

I Hill llll II 1 1 1 1 1 1 i minim 

16 7854 CaAACCGATTTT gGGTTACACATTTACAAGCAAcTTATATAATAATACTaa 

I III I III III III I II II I II I HIM 

31 7860 agGTattAcaccgtTTTcGGTTACAGtTTTACAAGCAAtTGtTCTTtTTATACT 

ii in i ii i ii til Mini 

18 7800 ctGTgcatacatagTTratGcaACcGaaaTAggttgggcaGcaCaTacTATACTtttc 

con cg-aac ttt-ggttatg — acccat-tA-a-ttc-tt-ttataataatact 

015-CTGTGCATACATAGTTTATGCAACCGAAATAGGTTGGGCAGCACATACTATACTTTTC- (015 ) 
024 -CAAACCGATTTT GGGTTACACATTTACAAGCAACTTATATAATAATACTAA ( -02 4 ) 



Claims 



Claims (or the tollowing Contracting States : AT, BE, CH, LI, DE, DK, FR, GB, GR, IT, NL, SE 

1. A composition uselul in LCR lor amplifying the DNA of human papilloma virus present in a test sample, said 
composition comprising a set of four oligonucleotide probes, said probe sets being selected from the group con- 
sisting of the following oligonucleotide sets: 

LCR5: SEQ ID No. 

81 GCTGCAAACA ACTATACAT6 ATATAA 

82 TTATATCATG TATAGTTGTT TGCAGC 

83 TATTAGAATG TGTGTACTGC AAGCA 

84 TGCTTGCAGT ACACACATTC TAATA ; 



LCR6: SEQ 10 No. 

85 CTTCACTGCA AGACATAGAA ATAA, 

86 TTATTTCTAT GTCTTGCAGT GAA 

87 CCTGTGTATA TTGCAAGACA GTAT 

88 TACTGTCTTG CAATATACAC AGO;' 



LCR7: SEQ ID No. 

89 TATATTGCAA 

90 GTTCCAATAC 

91 TTACAGAGGT 

92 AATGCAAATT 



GACAGTATTG GAAC, 
TGTCTTGCAA TTTA, 
ATTTGAATTT GCATT, 
CAAATACCTC TGTAA ; and 



LCR8: SEQ ID No. 

93 GTATGGAACA ACATTAGAAC AGCA 

94 TGCTGTTCTA ATGTTGTTCC ATAc' 

95 ATACAACAAA CCGTTGTGTG ATTT* 

96 AAATCACACA ACGGTTTGTT GTAT ' 
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A composition according to claim 1 for amplifying the DNA of human papilloma virus type 1 6 present in a test 
sample, said composition comprising a set of four oligonucleotide probes, said probe sets being selected from the 
group consisting of the following oligonucleotide sets: 

LCR5 (SEQ ID Nos. 81 ,82,83 and 84) and LCR8 (SEQ ID Nos. 93, 94, 95 and 96). 

A composition according to claim 1 for amplifying the DNA of human papilloma virus type 18 present in a test 
sample, said composition comprising a set of four oligonucleotide probes, said probe sets being selected from the 
group consisting of the following oligonucleotide sets: 

LCR 6(SEQ ID Nos. 85,86,87 and 88) and LCR 7 (SEQ ID Nos. 89,90,91 and 92). 

A kit for detecting the presence of human papilloma virus DNA in a test sample, comprising: 
a composition according to any of claims 1 to 3; and 
further comprising a ligase. 

A kit according to claim 4, wherein said ligase is thermostable. 

A composition useful in PCR for amplifying the DNA of human papilloma virus present in a test sample, said 
composition comprising: 

a first nucleic acid primer of sense direction, capable of hybridizing to the antisense strand of HPV DNA , said 
primer having from 10 to about 30 nucleotides in length and having a sequence selected from the group 
consisting of the following sequences: 

Sffl ID No, - 

) CA6ATGTCTC TGTGGCGGCC TAGTG. 



6 


GAATTAGTTA 


GACCATTTAA 


AAG. 


7 


GGGGAAACAC 


CAGAATG6AT 


A, 


81 


GCTGCAAACA 


ACTATACATG 


ATATAA. 


85 


CTTCACTGCA 


AGACATAGAA 


ATAA. 


89 


TATATTGCAA 


GACAGTATTG 


GAAC and 


93 


GTATGGAACA 


ACATTAGAAC 


AGCA; and 



a second nucleic acid primer of antisense direction, capable of hybridizing to the sense strand of HPV DNA, 
said primer having from 10 to about 30 nucleotides in length and-having a sequence selected from the group 
consisting of the following sequences: 

SEQ IP nq, 



5 


AGGTGTCAG6 


AAAACCAAAT 


TTATT. 


84 


TGCTTGCAGT 


ACACACATTC 


TAATA, 


88 


TACTGTCTTG 


CAATATACAC 


AGG, 


92 


AATGCAAATT 


CAAATACCTC 


TGTAA and 


96 


AAATCACACA 


ACGGTTTGTT 


GTAT; 



provided said first and second primers hybridize to their respective antisense and sense strands at locations 
such that their 3' ends do not overlap and, in the direction of extension, the 5' ends of said primers are spaced 
further apart than the 3* ends of said primers. 

A composition according to claim 6 wherein said first and second primers are selected from the following pairs of 
oligonucleotide sequences ( identified by Sequence ID No.): 
1 and 5, 6 and 5, 7 and 5, 81 and 84, 
85 and 88, 89 and 92, and 93 and 96. 

A kit for detecting the presence of human papilloma virus DNA in a test sample, comprising: 
a composition according to claim 6 or 7 ; and 
further comprising a polymerase. 
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9. A kit according to claim 8 wherein said polymerase is thermostable. 

10. A consensus oligonucleotide for hybridizing human papilloma virus types 6, 11, 16, 18, 31, 33 and 61, which 
oligonucleotide comprises from about 10 to about 60 nucleotides in length and is selected from the group of se- 
quences consisting of: 

SFQ ID No. 

! CA6ATGTCTC T6T6GCGGCC TAGTG. 

5 AGGTGTCAGG AAAACCAAAT TTATT, 

6 GAATTAGTTA GACCATTTAA AAG and 

7 GGGGAAACAC CAGAATGGAT A; 

and their complements. 

11. A type-specific oligonucleotide for determining the presence of human papilloma virus type 16, having a sequence 
selected from the group consisting of: 



5FQ 'D N 0l 








81 


GCT6CAAACA 


ACTATACATG 


ATATAA, 


82 


TTATATCATG 


TATAGTTGTT 


TGCAGC, 


83 


TATTAGAATG 


TGTGTACTGC 


AAGCA , 


84 


TGCTTGCAGT 


ACACACATTC 


TAATA. 


93 


GTATGGAACA 


ACATTAGAAC 


AGCA , 


94 


TGCTGTTCTA 


ATGTTGTTCC 


ATAC. 


95 


ATACAACAAA 


CC6TTGTGTG 


ATTT and 


96 


AAATCACACA 


ACGGTTTGTT 


GTAT; 



and their complements. 

12. A type-specific oligonucleotide for determining the presence of human papilloma virus type 1 8, having a sequence 
selected from the group consisting of: SEQ ID No. 

SEP Nn 



85 


CTTCACTGCA 


AGACATA6AA 


ATAA, 


86 


TTATTTCTAT 


GTCTTGCAGT 


GAA, 


87 


CCTGTGTATA 


TTGCAAGACA 


GTAT. 


88 


TACTGTCTTG 


CAATATACAC 


AGG, 


89 


TATATTGCAA 


GACAGTATT.6 


GAAC. 


90 


GTTCCAATAC 


TGTCTTGCAA 


TTTA, 


91 


TTACAGAGGT 


ATTTGAATTT 


GCATT and 


92 


AATGCAAATT 


CAAATACCTC 


TGTAA; 



and their complements. 

13. A method for determining the presence of any human papilloma virus in a test sample, comprising: 

a. hybridizing DNA in the test sample with at least one consensus oligonucleotide selected from the group of 
claim 10, said oligonucleotide being conjugated to a signal generating compound capable of producing a 
detectable signal; and 

b. determining the presence of human papilloma virus by detecting the signal generated. 

14. A method for determining the presence of human papilloma virus type 16 in a test sample, comprising: 

a. hybridizing DNA in the test sample with at least one oligonucleotide selected from the group of claim 11, 
said oligonucleotide being conjugated to a signal generating compound capable of producing a detectable 
signal; and 

b. determining the presence of human papilloma virus by detecting the signal generated. 
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15. A method for determining the presence of human papilloma virus type 18 in a test sample, comprising: 

a. hybridizing DNA in the test sample with at least one oligonucleotide selected from the group of claim 12, 
said oligonucleotide being conjugated to a signal generating compound capable of producing a detectable 
signal; and 

b. determining the presence of human papilloma virus by detecting the signal generated. 

16. A method according to any of claims 1 3-15, further comprising a step of amplification prior to or concurrent with 
said hybridizing step. 

17. A method according to claim 16, wherein said amplification step comprises PCR or LCR. 



Claims for the following Contracting States : ES 

1. A composition useful in LCR for amplifying the DNA of human papilloma virus present in a test sample, said 
composition comprising a set of four oligonucleotide probes, said probe sets being selected from the group con- 
sisting of the following oligonucleotide sets: 



ICRS: SEQ ID No. 

81 GCTGCAAACA 

82 TTATATCATG 

83 TATTAGAATG 

84 TGCTTGCAGT 



ACTATACATG ATATAA, 
TAT AGTTGTT TGCAGC , 
TGTGTACTGC AAGCA , 
ACACACATTC TAATA; 



LCR6: SEQ ID No. 

85 CTTCACTGCA 

86 TTATTTCTAT 

87 CCTGTGTATA 

88 TACTGTCTTG 



AGACATAGAA ATAA , 
GTCTTGCAGT GAA, 
TTGCAAGACA GTAT , 
CAATATACAC AGG; 



ICR7: 



SEQ ID No. 
89 
90 
9i 
92 



TATATTGCAA 
GTTCCAATAC 
TTACAGAGGT 
AATGCAAATT 



GACAGT ATTG 
TGTCTTGCAA 
ATTTGAATTT 
CAAATACCTC 



GAAC , 
TTT A , 
GCATT , 
TGTAA; and 



LCR8: SEQ ID No. 

93 GTATGGAACA ACATTAGAAC AGCA , 

94 TGCTGTTCTA ATGTTGTTCC ATAC. 

95 ATACAACAAA CCGTTGTGTG ATTT, 

96 AAATCACACA ACGGTTTGTT GTAT. 



2. A composition according to claim 1 for amplifying the DNA of human papilloma virus type 16 present in a test 
sample, said composition comprising a set of four oligonucleotide probes, said probe sets being selected from the 
group consisting of the fol lowing oligonucleotide sets : 

LCR5 (SEQ ID Nos. 81,82,83 and 84) and LCR8 (SEQ ID Nos. 93, 94, 95 and 96). 

3. A composition according to claim 1 for amplifying the DNA of human papilloma virus type 18 present in a test 
sample, said composition comprising a set of four oligonucleotide probes, said probe sets being selected from the 
group consisting of the following oligonucleotide sets: 

LCR6(SEQ ID Nos. 85,86.87 and 88) and LCR 7(SEQ ID Nos. 89,90,91 and 92). 

4. A kit for detecting the presence of human papilloma virus DNA in a test sample, comprising: 
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10 



15 



20 



25 



a composition according to any of claims 1 to 3; and further comprising a ligase. 

5. A kit according to claim 4, wherein said ligase is thermostable. 

6. A composition useful in PGR for amplifying the DNA of human papilloma virus present in a test sample, said 
composition comprising: 

a first nucleic acid primer of sense direction, capable of hybridizing to the antisense strand of HPV DNA, said 
primer having from 10 to about 30 nucleotides in length and having a sequence selected from the group 
consisting of the following sequences: 

SEP ID No . 



I 


CAGATOTCTC 


TGTGGCGGCC 


TAGTG. 


6 


GAATTAGTTA 


GACCATTTAA 


AAG , 


7 


GGGGAAACAC 


CAGAATGGAT 


A, 


8! 


GCTGCAAACA 


ACT ATACATG 


ATATAA , 


85 


CTTCACTGCA 


AGACATAGAA 


ATAA. 


89 


TATATTGCAA 


GACAGTATTG 


GAAC and 


93 


GTATGGAACA 


ACATTAGAAC 


AGCA; and 



a second nucleic acid primer of antisense direction, capable of hybridizing to the sense strand of HPV DNA, 
said primer having from 10 to about 30 nucleotides in length and having a sequence selected from the group 
consisting of the following sequences: 

SEP ID No. 



30 



35 



40 



5 


AGGTGTCAGG 


AAAACCAAAT 


TTATT. 


84 


TGCTTGCAGT 


ACACACATTC 


TAATA, 


88 


TACTGTCTTG 


CAATATACAC 


AGG , 


92 


AATGCAAATT 


CAAATACCTC 


TGTAA and 


96 


AAATCACACA 


ACGGTTTGTT 


GTAT; 



provided said first and second primers hybridize to their respective antisense and sense strands at locations 
such that their 3' ends do not overlap and, in the direction of extension, the 5* ends of said primers are spaced 
further apart than the 3* ends of said primers. 

7. A composition according to claim 6 wherein said first and second primers are selected from the following pairs of 
oligonucleotide sequences (identified by Sequence ID No.): 

1 and 5, 6 and 5, 7 and 5, 81 and 84, 
85 and 88, 89 and 92, and 93 and 96. 

8. A kit for detecting the presence of human papilloma virus DNA in a test sample, comprising: 
45 a composition according to claim 6 or 7; and 

further comprising a polymerase. 

9. A kit according to claim 8 wherein said polymerase is thermostable. 

50 10. A method for determining the presence of any human papilloma virus in a test sample, comprising: 

a. hybridizing DNA in the test sample with at least one consensus oligonucleotide selected from the group of 
sequences consisting of: 

55 
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SFQ LB "0. 

1 CAGAT6TCTC T6TGGC66CC TAGTG, 

5 AG6TGTCAGG AAAACCAAAT TTATT , 

6 GAATTAGTTA GACCATTTAA AAG and 

7 GGGGAAACAC CAGAATGGAT A; 

and their complements, 

said oligonucleotide being conjugated to a signal generating compound capable of producing a detectable 
signal; and 

b. determining the presence of human papilloma virus by detecting the signal generated. 

11, .A method for determining the presence of human papilloma virus type 16 in a test sample, comprising: 

a. hybridizing DNA in the test sample with at least one oligonucleotide selected from the group of sequences 
consisting of: 

SEO LB wo. 



81 
82 
83 
84 
93 
94 
95 
96 



6CTGCAAACA 
TTATATCATG 
TATTAGAATG 
TGCTTGCAGT 
GTATGGAACA 
TGCTGTTCTA 
ATACAACAAA 
AAATCACACA 



ACTATACATG 
TATAGTTGTT 
TGTGTACT6C 
ACACACATTC 
ACATTAGAAC 
ATGTTGTTCC 
CC6TTGTGTG 
ACGGTT7GTT 



ATATAA, 
TGCAGC, 
AAGCA. 
TAATA. 
AGCA , 
ATAC, 
ATTT and 
GTAT; 



and their complements, said oligonucleotide being conjugated to a signal generating compound capable of 
producing a detectable signal; and 

b. determining the presence of human papilloma virus by detecting the signal generated. 



12. . A method for determining the presence of human papilloma virus type 18 in a test sample, comprising: 



a. hybridizing DNA in the test sample with at least one oligonucleotide selected from the group of sequences 
consisting of: 



85 


CTTCACTGCA 


86 


TTATTTCTAT 


67 


CCTGTGTATA 


88 


TACTGTCTTG 


89 


TATATTGCAA 


90 


GTTCCAATAC 


91 


TTACAGAGGT 


92 


AATGCAAATT 



AGACATAGAA ATAA , 
GTCTTGCAGT GAA , 
TTGCAAGACA GTAT, 
CAATATACAC AGG , 
GACAGTATT.G GAAC. 
TGTCTTGCAA TTT A, 
ATTTGAATTT GCATT and 
CAAAT ACCTC .TGTAA; . 



and their complements, 

said oligonucleotide being conjugated to a signal generating compound capable of producing a detectable 
signal; and 

b. determining the presence of human papilloma virus by detecting the signal generated. 

13. A method according to any of claims 10-12 , further comprising a step of amplification prior to or concurrent with 
said hybridizing step. 

14. A method according to claim 1 3, wherein said amplification step comprises PCR or LCR. 
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Patentanspruche 



Patentanspruche fur folgende Vertragsstaaten : AT, BE, CH, LI, DE, DK, FR, GB, GR, IT, NL, SE 

1. Zusammensetzung, die fur die LCR ('ligase chain reaction", Ligasekettenreaktion) zur Vervielfachung der DNA 
des humanen Papillomavirus nutzlich ist, der in einer Testprobe vorhanden ist, wobei die Zusammensetzung einen 
Satz von vier Oligonukleotidsonden umfaftt, wobei die Sondensatze aus der Gruppe gewahlt sind, die aus den 
folgenden Oligonukleotidsatzen besteht: 



LCRS: SEQ ID N r 

8 1 GCTGCAAACA 

32 TTaTATCaTQ 

33 7a77aGaa7G 

34 TGCTTGCAGT 



ACTATACATG ATATAA. 
TATAGTTGiT TGCAGC, 
7G7G7AC7GC AAGCA , 
ACACACATTC TAATA; 



5£QI0Nr 

85 CT7CACTGCA AGACATAGAA ATAA, 

86 TTATTTCTAT GTCTTGCAGT GAA , 
37 CCTGTGTATA TT6CAA6ACA GTAT , 
88 TACTGTCTTG CAATA7ACAC AGG ; 



LCR7: ScG lONr 

89 TAlATTGCAA 

90 GTTCCAATAC 

9 1 TTACAGAGGT 

92 AATGCAAATT 



GACAG7A77G GAAC , 
TGTCTTGCAA TTTA , 
ATTTGAATTT GCA77, 
CAAATACCTC TGTAA; und 



LCR 3: SEQ ID Nr 

93 G7ATGGAACA 

94 TGCTGTTCTA 

95 ATACAACAAA 

96 AAATCACACA 



acattagaac agca, 
a7g77g77cc a7ac , 
ccgt7gtgtg a777, 

ACGG777G77 G7A7. 



2. Zusammensetzung nach Anspruch 1 zur Vervielfachung der DNA des humanen Papillomavirus Typ 1 6, der in 
einer Testprobe vorhanden ist, wobei die Zusammensetzung einen Satz von vier Oligonukleotidsonden umfaftt, 
wobei die Sondensatze aus der Gruppe gewahlt sind, die aus den folgenden Oligonukleotidsatzen besteht: 
LCRS (SEQ ID Nrn 81, 82, 83 und 84) und LCR8 (SEQ ID Nm 93, 94, 95 und 96) 

3. Zusammensetzung nach Anspruch 1 zur Vervielfachung der DNA des humanen Papillomavirus Typ 1 8, der in 
einer Testprobe vorhanden ist, wobei die Zusammensetzung einen Satz von vier Oligonukleotidsonden umfa&t, 
wobei die Sondensatze aus der Gruppe gewahlt sind, die aus den folgenden Oligonukleotidsatzen besteht: 
LCR6 (SEQ ID Nrn 85, 86, 87 und 88) und LCR7 (SEQ ID Nm 89, 90, 91 und 92). 

4. Kit zum Nachwets der Anwesenheit der DNA des humanen Papillomavirus in einer Testprobe, das folgendes urn- 
faf3t: 

eine Zusammensetzung nach einem der Anspruche 1 bis 3, und des weiteren eine Ligase. 

5. Kit nach Anspruch 4, worin die Ligase thermostabil ist. 

6. Zusammensetzung, die bei der PCR ("polymerase chain reaction' Polymerasekettenreaktion) zur Vervielfachung 
der DNA des humanen Papillomavirus nutzlich ist, der in einer Testprobe vorhanden ist, wobei die Zusammenset- 
zung folgendes umfaGt: 
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einen ersten Nukleinsaureprimer, der zur Richtung gleichlaufig ist, welcher zur Hybridisierung an den gegen- 
laufigen Strang der HPV-DNA befahigt ist, wobei der Primer 10 bis ungefahr 30 Nukleotide lang ist und eine 
Sequenz autweist, die aus der Gruppe gewahlt ist, die aus den folgenden Sequenzen besteht: 



I CAGATGTCTC T6T6GCGGCC TAGTG, 



6 


GAATTAGTTA 


GACCATTTAA 


AAG. 




7 


GGGGAAACAC 


CAGAATGGAT 


A. 




81 


GCTGCAAACA 


ACTATACATG 


ATATAA. 


85 


CTTCACTGCA 


AGACATAGAA 


ATAA, 




89 


TATATTGCAA 


GACAGTATTG 


GAAC 


und 


93 


GTA7GGAACA 


ACATTAGAAC 


AGCA; 


und 



einen zweiten Nukleinsaureprimer, der zur Richtung gegenlaufig ist, welcher zur Hybridisierung an den gleich- 
laufigen Strang der HPV-DNA befahigt ist, wobei der Primer 10 bis ungefahr 30 Nukleotide lang ist und eine 
Sequenz aufweist, die aus der Gruppe gewahlt ist, die aus den folgenden Sequenzen besteht: 

SEQ IP Nr 



5 


AGGTGTCAGG 


AAAACCAAAT 


84 


TGCTTGCAGT 


ACACACATTC 


88 


TACTGTCTTG 


CAATATACAC 


92 


AATGCAAATT 


CAAATACCTC 


96 


AAATCACACA 


ACGGTTTGTT 



vorausgesetzt, daB der erste und der zweite Primer an ihre jeweiligen gleich- und gegenlaufigen Strange an 
solchen Stellen hybridisieren, daG ihre 3'-Enden nicht uberlappen, und daG die 5'-Enden der Primer in Ver- 
langerungsrichtung weiter raumlich abgesetzt sind als die 3'-Enden der Primer. 



7. Zusammensetzung nach Anspruch 6, worin der erste und zweite Primer aus den folgenden Paaren von Oligonu- 
kleotidsequenzen (die durch die Sequenz ID Nr bezeichnet sind) gewahlt sind: 

1 und 5, 6 und 5, 7 und 5, 81 und 84, 
85 und 88, 89 und 92, und 93 und 96. 

8. Kit zum Nachweis der Anwesenheit der DNA des humanen Papillomavirus in einer Testprobe, das folgendes um- 
faBt: 

eine Zusammensetzung nach Anspruch 6 Oder ,7 und des weiteren eine Polymerase. 



9. Kit nach Anspruch 8, worin die Polymerase thermostabil ist. 



10. Consensus-Oligonukleotid zur Hybridisierung der humanen papillomaviren Typ 6, 11, 16, 18, 31, 33 und 61, wobei 
das Oligonukleotid ungefahr 10 bis ungefahr 60 Oligonukleotide lang ist und aus der Gruppe von Sequenzen 
gewahlt ist, die aus folgendem besteht: 

SfiQ IP Nr 



I CAGATGTCTC 

5 AGGTGTCAGG 

6 GAATTAGTTA 

7 GGGGAAACAC 



TGTGGCGGCC TAGTG, 
AAAACCAAAT TTATT, 
GACCATTTAA AAG 
CAGAATGGAT A; un< 
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und aus deren Komplementen. 



11 . Typ-spezifisches Oligonukleotid zur Bestimmung der Anwesenheit des humanen Papillomavirus Typ 1 6, das eine 
Sequenz aufweist. die aus der Gruppe gewahlt ist, die aus (olgendem besteht: 

SEP ID Mr 



8 1 GCT6CAAACA 

82 TTATATCATG 

83 TATTASAATG 

84 TGCTTGCAGT 

93 GTATGGAACA 

94 TGCTGTTCTA 

95 ATACAACAAA 

96 AAATCACACA 

und aus deren Komplementen. 



ACTATACATG ATATAA , 
TATAGTTGTT TGCAGC 
TGTGTACTGC AAGCA. ' 
ACACACATTC TAATA. 
ACATTAGAAC AGCA , 
ATGTTGTTCC ATAC, 
CCGTTGTGTG ATTT nn 
ACGGTTTGTT GTAT ; 



12. Typ-spezifisches Oligonukleotid zur Bestimmung der Anwesenheit des humanen Papillomavirus Typ 18, das eine 
Sequenz aufweist, die aus der Gruppe gewahlt ist, die aus folgendem besteht: 



fiflO IP Nr 



85 
86 
87 
88 
89 
90 
91 
92 

und aus deren Komplementen. 



CTTCACTGCA 
TTATTTCTAT 
CCTGTGTATA 
TACTGTCTTG 
TATATTGCAA 
GTTCCAATAC 
TTACAGAGGT 
AATGCAAATT 



AGACATAGAA ATAA. 
GTCTTGCAGT GAA , 
TTGCAAGACA GTAT. 
CAATATACAC AGG , 
GACAGTATT.G GAAC. 
TGTCTTGCAA TTTA, 
ATTTGAATTT GCATT 
CAAATACCTC .TGTAA; 



und 



13. Verfahren zur Bestimmung der Anwesenheit irgendeines humanen Papillomavirus in einer Testprobe, das folgen- 
des umfaftt: 



a. Hybridisieren der DNA in der Testprobe mit wenigstens einem Consensus-Oligonukleotid, das aus der Grup- 
pe nach Anspruch 10 gewahlt ist, wobei das Oligonukleotid an eine signalerzeugende Verbindung konjugiert 
ist, die zur Erzeugung eines nachweisbaren Signals befahigt ist, und 

b. Bestimmen der Anwesenheit des humanen Papillomavirus, indem das erzeugte Signal nachgewiesen wird. 

14. Verfahren zur Bestimmung der Anwesenheit des humanen Papillomavirus Typ 16 in einer Probe, das folgendes 
umfaBt: 



a. Hybridisieren der DNA in der Testprobe mit wenigstens einem Oligonukleotid, das aus der Gruppe nach 
Anspruch 11 gewahlt ist, wobei das Oligonukleotid an eine signalerzeugende Verbindung konjugiert ist, die 
zur Erzeugung eines nachweisbaren Signals befahigt ist, und 

b. Bestimmen der Anwesenheit des humanen Papillomavirus, indem das erzeugte Signal nachgewiesen wird. 

15. Verfahren zur Bestimmung der Anwesenheit des humanen Papillomavirus Typ 18 in einer Testprobe, das folgendes 
umfaBt: 



a. Hybridisieren der DNA in der Testprobe mit wenigstens einem Oligonukleotid, das aus der Gruppe nach 
Anspruch 12 gewahlt ist, wobei das Oligonukleotid an eine signalerzeugende Verbindung konjugiert ist, die 
zur Erzeugung eines nachweisbaren Signals befahigt ist, und 
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b. Bestimmen der Anwesenheit des humanen Papillomavirus, indem das erzeugte Signal nachgewiesen wird. 

16. Vertahren nach einem der Anspruche 13-15, das des weiteren einen Vervielfachungsschritt umfaGt, der vor Oder 
in Konkurrenz mit dem Hybridisierungsschritt stattfindet. 

17. Vertahren nach Anspruch 16, worin der Vervielfachungsschritt PCR Oder LCR umfaGt. 
Patentanspruchefur folgenden Vertragsstaat : ES 

1. Zusammensetzung, die fur die LCR ("ligase chain reaction", Ligasekettenreaktion) zur Vervielfachung der DNA 
des humanen Papillomavirus nutzlich ist, der in einer Testprobe vorhanden ist, wobei die Zusammensetzung einen 
Satz von vier Oligonukleotidsonden umfaGt, wobei die Sondensatze aus der Gruppe gewahlt sind, die aus den 
folgenden Oligonukleotidsatzen besteht: 



ICRS: 



5EQI0N^ 
8! 
82 
83 
84 



GCTGCAAACA 
TTATATCATG 
TATTAGAATG 
TGCTTGCAGT 



ACTATACATG 
TaTAGTTGTT 
TGTGTACTGC 
ACACACATTC 



A7A7AA, 

tgcagc 

AAGCA, 
TAATA ; 



ICR6: StGIDNr 

85 CTTCACTGCA AGACATAGAA ATAA , 

86 TTATTTCTAT GTCTTGCAGT GAA , 

87 CCTG7GTATA TTGCAAGACA GTAT , 

88 TACTGTCTTG CAATATACAC AGG ; 



LCR7: ScQ lONr 

89 TA7A77GCAA 

90 G77CCAA7AC 

91 T7ACAGAGG7 

92 AA7GCAAA77 



GACAG7A77G GAAC , 
7G7C77GGAA 777a, 
A777GAA777 GCA77 , 
CAAA7ACCTC 7G7AA; 



LCR 3: ScQ 10 Nr 

93 G7A7GGAACA 

94 7GC7G77C7A 

95 A7ACAACAAA 

96 AAA7CACACA 



ACA77AGAAC AGCA, 

A7G77G77CC A7AC , 

CCG77G7G7G A7T7, 

ACGG777G77 G7A7. 



2. Zusammensetzung nach Anspruch 1 zur Vervielfachung der DNA des humanen Papillomavirus Typ 16, der in 
einer Testprobe vorhanden ist, wobei die Zusammensetzung einen Satz von vier Oligonukleotidsonden umfaGt, 
wobei die Sondensatze aus der Gruppe gewahlt sind. die aus den folgenden Oligonukleotidsatzen besteht: 
LCR5 (SEQ ID Nrn 81 , 82, 83 und 84) und LCRS (SEQ ID Nrn 93, 94, 95 und 96). 

3. Zusammensetzung nach Anspruch 1 zur Vervielfachung der DNA des humanen Papillomavirus TYP 18, der in 
einer Testprobe vorhanden ist, wobei die Zusammensetzung einen Satz von vier Oligonukleotidsonden umfaGt, 
wobei die Sondensatze aus der Gruppe gewahlt sind, die aus den folgenden Oligonukleotidsatzen besteht: 
LCR6 (SEQ ID Nrn 85, 86, 87 und 88) und LCR7 (SEQ ID Nm 89, 90, 91 und 92). 

4. Kit zum Nachweis der Anwesenheit der DNA des humanen Papillomavirus in einer Testprobe, das folgendes um- 
faGt: 

eine Zusammensetzung nach einem der Anspruche 1 bis 3, und des weiteren eine Ligase. 

5. Kit nach Anspruch 4, worin die Ligase thermostabil ist. 
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6. Zusammensetzung, die bei der PCR ("polymerase chain reaction" polymerasekettenreaktion) zur Vervielfachung 
der DNA des humanen Papillomavirus nutzlich ist, der in einer Testprobe vorhanden ist, wobei die Zusammenset- 
zung folgendes umfaGt: 

einen ersten Nukleinsaureprimer, der zur Richtung gleichlaufig ist, welcher zur Hybridisierung an den gegen- 
laufigen Strang der HPV-DNA befahigt ist, wobei der Primer 10 bis ungefahr 30 Nukleotide lang ist und eine 
Sequenz aufweist, die aus der Gruppe gewahlt ist, die aus den folgenden Sequenzen besteht: 
SEP ID Nr 



1 


CAGATGTCTC 


TGTGGCGGCC 


TAGTG 


i 


6 


GAA7TAGTTA 


GACCATTTAA 


AAG, 




7 


GGGGAAACAC 


CAGAATGGAT 


A. 




81 


GCTGCAAACA 


ACT ATACATG 


ATATA. 


* ♦ 


85 


CTTCACTGCA 


AGACATAGAA 


ATAA. 




89 


TAT ATTGCAA 


GACAGTATTG 


GAAC 


und 


93 


GTATGGAACA 


ACATTAGAAC 


AGCA; 


und 



einen zweiten Nukleinsaureprimer, der zur Richtung gegenlaufig ist, welcher zur Hybridisierung an den gleich- 
laufigen Strang der HPV-DNA befahigt ist, wobei der Primer 10 bis ungefahr 30 Nukleotide lang ist und eine 
Sequenz aufweist, die aus der Gruppe gewahlt ist, die aus den folgenden Sequenzen besteht: 



SEO 


ID Nr 






S 


AGGTGTCAGG 


AAAACCAAAT 


TTATT, 


84 


TGCTTGCAGT 


ACACACATTC 


TAATA, 


88 


TACTGTCTTG 


CAATATACAC 


AGG, 


92 


AATGCAAATT 


CAAATACCTC 


TGTAA 


96 


AAATCACACA 


ACGGTTTGTT 


GTAT; 



und 



vorausgesetzt, daG der erste und der zweite Primer an ihre jeweiligen gleich- und gegenlaufigen Strange an 
solchen Stellen hybridisieren, daG ihre 3'-Enden nicht uberlappen, und daG die 5'«Enden der Primer in Ver- 
langerungsrichtung weiter raumlich abgesetzt sind als die 3'-Enden der Primer. 

7. Zusammensetzung nach Anspruch 6, worin der erste und zweite Primer aus den folgenden Paaren von Oligonu- 
kleotidsequenzen (die durch die Sequenz ID Nr bezeichnet sind) gewahlt sind: 

1 und 5, 6 und 5, 7 und 5, 81 und 84, 
85 und 88, 89 und 92, und 93 und 96. 

8. Kit zum Nachweis der Anwesenheit der DNA des humanen Papillomavirus in einer Testprobe, das folgendes um- 
faGt: 

eine Zusammensetzung nach Anspruch 6 Oder 7, und des weiteren eine Polymerase. 

9. Kit nach Anspruch 8, worin die Polymerase thermostabil ist. 

10. Verfahren zur Bestimmung der Anwesenheit irgendeines humanen papillomavirus in einer Testprobe, das folgen- 
des umfaGt: 

a. Hybridisieren der DNA in der Testprobe mit wenigstens einem Consensus-Oligonukleotid, das aus der Grup- 
pe von Sequenzen gewahlt ist, die aus folgendem besteht: 



62 



EP 0 477 972 B1 



SEO IP Nr 



5 
6 
7 



CAGATGTCTC tgtggcggcc tagtg, 

AGGTGTCAGG AAAACCAAA 7 TTATT 

GAATTAGTTA GACCATTTAA AAG und 

GGGGAAACAC CAGAATGGAT A; 



und aus deren Kompiementen, 

wobei das Oligonukleotid an eine signaierzeugende Verbindung konjugiert ist, die zur Erzeugung eines nach- 
weisbaren Signals befahigt ist, und 

b. Bestimmen der Anwesenheit des humanen Papillomavirus, indem das erzeugte Signal nachgewiesen wird. 

verfahren zur Bestimmung der Anwesenheit des humanen Papillomavirus Typ 16 in einer Probe, das folgendes 
umfaGt: 

a. Hybridisieren der DNA in der Testprobe mit wenigstens einem Oligonukleotid, das aus der Gruppe von 
Sequenzen gewahlt ist, die aus folgendem besteht: 



SEQ IP Nr 



81 GCT6CAAACA ACTATACATG ATATAA, 

82 TTATATCATG TATAGTTGTT TGCAGC, 

83 TATTAGAATG TGTGTACTGC AAGCA , 

84 TGCTTGCAGT ACACACATTC TAATA, 

93 GTATGGAACA ACATTAGAAC AGCA, 

94 TGCTGTTCTA ATGTTGTTCC ATAC. 

95 ATACAACAAA CCGTTGTGTG ATTT un d 

96 AAATCACACA ACGGTTTGTT GTAT ; 



und aus deren Kompiementen, 

wobei das Oligonukleotid an eine signaierzeugende Verbindung konjugiert ist, die zur Erzeugung eines nach- 
weisbaren Signals befahigt ist, und 

b. Bestimmen der Anwesenheit des humanen Papillomavirus, indem das erzeugte Signal nachgewiesen wird. 

Verfahren zur Bestimmung der Anwesenheit des humanen Papillomavirus Typ 1 8 in einer Testprobe, das folgendes 
umfaGt: 

a. Hybridisieren der DNA in der Testprobe mit wenigstens einem Oligonukleotid, das aus der Gruppe von 
Sequenzen gewahlt ist, die aus folgendem besteht: 



m 
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SEQ ID 

85 CTTCACT6CA AGACATAGAA ATAA. 

86 TTATTTCTAT GTCTTGCAGT GAA, 

87 CCTGTGTATA TTGCAAGACA GTAT. 

88 TACTGTCTTG CAATATACAC AGG, 

89 TATATTGCAA GACAGTATT.G GAAC. 

90 GTTCCAATAC TGTCTTGCAA TTTA, 

91 TTACAGAGGT ATTTGAATTT GCATT und 

92 AATGCAAATT CAAATACCTC .TGTAA ; 



und aus deren Komplementen, 

wobei das Oligonukleolid an eine signalerzeugende Verbindung konjugiert ist, die zur Erzeugung eines nach- 
weisbaren Signals befahigt ist, und 

b. Bestimmen der Anwesenheit des humanen Papillomavirus, indem das erzeugte Signal nachgewiesen wird. 

13. Verfahren nach einem der Anspruche 10-12, das des weiteren einen Vervielfachungsschritt umfaBt, der vor oder 
in Konkurrenz mit dem Hybridisierungsschritt stattfindet. 

14. Verfahren nach Anspruch 13, worin der vervielfachungsschritt PCR oder LCR umfaflt. 
Revendications 



Revendications pour les Etats contractants suivants : AT, BE, CH, LI, DE, DK, FR, GB, GR, IT, NL, SE 

1. Composition utile dans la LCR pouramplifier I'ADN de virus du papillome humain present dans echantillon k doser, 
ladite composition comprenant un ensemble de quatre sondes oligonucleotidiques, lesdits ensembles de sondes 
etant selectionn6s dans le groupe constitue par les ensembles d'oligonucleotides suivants : 



LCR5 : n' ^identification 



81 
82 
83 



84 



GCTGCAAACA ACTATACATG ATATAA, 
TTATATCATG TATAGTTGTT TGCAGC, 
TATTAGAATG TGTGTACTGC AAGCA, 
TGCTTGCAGT ACACACATTC TAATA; 



LCR6 : n* ^identification 



85 
86 
87 
88 



CTTCACTGCA AGACATAGAA ATAA, 
TTATTTCTAT GTCTTGCAGT GAA, 
CCTGTGTATA TTGCAAGACA GTAT, 
TACTGTCTTG CAATATACAC AGG ; 
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LCR7 : n # ^identification 

89 TATATTGCAA GACAGTATTG GAAC 

90 GTTCCAATAC TGTCTTGCAA TTTA, 

91 TTACAGAGGT ATTTGAATTT GCATT, 

92 AATGCAAATT CAAATACCTC TGTAA ; et 

LCR8 : n* ^identification 

93 GTATGGAACA ACATTAGAAC AGCA, 

94 TGCTGTTCTA ATGTTGTTCC ATAC, 

95 ATACAACAAA CCGTTGTGTG ATTT, 

96 AAATCACACA ACGGTTTGTT GTAT. 



Composition selon la revendication 1 , destinee a amplifier I'ADN de virus du papillome humain de type 16 present 
dans un echantillon a doser, ladite composition comprenant un ensemble de quatre sondes oligonucleotidiques, 
lesdits ensembles de sondes etant selectionnes dans le groupe constitue par les ensembles d'oligonucleotides 
suivants : 

LCR5 (n° d'identification 81 , 82, 83 et 84) et LCR8 (n° d'identification 93, 94, 95 et 96). 

Composition selon la revendication 1 , destinee a amplifier I'ADN de virus du papillome humain de type 18 present 
dans un 6chantillon a doser, ladite composition comprenant un ensemble de quatre sondes oligonucleotidiques, 
lesdits ensembles de sondes 6tant s6lectionn6s dans le groupe constitue par les ensembles d'oligonucleotides 
suivants : 

LCR6 (n° d'identification 85, 86, 87 et 88) et LCR7 (n° d'identification 89, 90, 91 et 92). 

Kit de detection de la presence d'ADN de virus du papillome humain dans un echantillon a doser, comprenant : 
une composition selon Tune quelconque des revendications 1 a 3, et en outre une ligase. 

Kit selon la revendication 4, dans iequel ladite ligase est thermostable. 

Composition utile dans la PCR pour amplifier I'ADN de virus du papillome humain present dans un echantillon a 
doser, ladite composition comprenant : 

une premiere amorce d'acide nucleique de direction sens, capable de s'hybrider au brin antisens de I'ADN de 
HP V, ladite amorce ayant de 1 0 a environ 30 nucleotides de long et une sequence selectionn6e dans le groupe 
constitud par les sequences suivantes : 

N* ^identification 



1 


CAGATGTCTC 


TGTGGCGGCC 


TAGTG, 


6 


GAATTAGTTA 


GACCATTTAA 


AAG, 


7 


GGGGAAACAC 


CAGAATGGAT 


A, 


81 


GCTGCAAACA 


ACTATACATG 


ATATAA, 


85 


CTTCACTGCA 


AGACATAGAA 


ATAA, 


89 


TATATTGCAA 


GACAGTATTG 


GAACet 


93 


GTATGGAACA 


ACATTAGAAC 


AGCA ; et 



une deuxieme amorce d'acide nucleique de direction antisens, capable de s'hybrider au brin sens de I'ADN 
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de HPV, ladite amorce ayant de 10 a environ 30 nucleotides de long et une sequence selectionnee dans le 
groupe constitue par les sequences suivantes : 

N # ^identification 

5 AGGTGTCAGG AAAACCAAAT TTATT, 

84 TGCTTGCAGT ACACACATTC TAATA, 

88 TACTGTCTTG CAATATACAC AGG, 

92 AATGCAAATT CAAATACXTC TGTAAet 

96 AAATCACACA ACGGTTTGTT GTAT; 

pour autant que lesdites premiere et deuxieme amorces s'hybrident a leurs brins respectifs antisens et sens 
a des emplacements tels que leurs extremites 3* ne se chevauchent pas et que, dans la direction d'extenston, 
les extremites 5* desdites amorces soient plus espacees que les extremites 3' desdites amorces. 

7. Composition selon la revendication 6, dans laquelle lesdites premiere et deuxieme amorces sont s6lectionnees 
parmi les paires suivantes de sequences oligonud'eotidiques (identifiees'par leur numero d'identification) : 

1 et 5, 6 et 5, 7 et 5, 81 et 84, 
85 et 88, 89 et 92, et 93 et 96. 

8. Kit de detection de la presence d'ADN de virus du papillome humain dans un echantillon a doser, comprenant : 

une composition selon la revendication 6 ou 7 et en outre une polymerase. 

9. Kit selon la revendication 8, dans lequel ladite polymerase est thermostable. 

10. Oligonucleotide consensus pour hybridation du virus du papillome humain des types 6, 11, 16, 18, 31, 33 et 61, 
lequel oligonucleotide a d'environ 10 a environ 60 nucleotides de long et est selectionnd dans le groupe de se- 
quences constitue par : 



N* d'identification 

1 CAGATGTCTC TGTGGCGGCC TAGTG, 

5 AGGTGTCAGG AAAACCAAAT TTATT, 

6 GAATTAGTTA GACCATTTAA AAGet 

7 GGGGAAACAC CAGAATGGAT A; 

et leurs complements. 

11. Oligonucleotide specifique d'un type, destine a determiner la presence du virus du papillome humain de type 16, 
ayant une sequence selectionnee dans le groupe constitue par : 

N # ^identification 



81 


GCTGCAAACA 


ACTATACATG 


ATATAA, 


82 


TTATATCATG 


TATAGTTGTT 


TGCAGC, 


83 


TATTAGAATG 


TGTGTACTGC 


AAGCA, 


84 


TGCTTGCAGT 


ACACACATTC 


TAATA, 


93 


GTATGGAACA 


ACATTAGAAC 


AGCA, 


94 


TGCTGTTCTA 


ATGTTGTTCC 


ATAC, 


95 


ATACAACAAA 


CCGTTGTGTG 


ATTTet 


96 


AAATCACACA 


ACGGTTTGTT 


GTAT; 



et leurs complements. 

12. Oligonucleotide sp6cifique d'un type, destine a determiner la presence du virus du papillome humain de type 18, 
ayant une sequence selectionnee dans le groupe constitue par : 
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N* ^identification 



85 
86 
87 
88 
89 
90 
91 
92 



CTTCACTCCA AGACATAGAA 
TTATTTCTAT GTCTTGCAGT 
CCTGTGTATA TTGCAAGACA 
TACTGTCTTG CAATATACAC 
TATATTGCAA GACAGTATTG 
GTTCCAATAC TGTCTTGCAA 
TTACAGAGGT ATTTGAATTT 
AATGCAAATT CAAATACCTC 



ATAA, 

GAA, 

GTAT, 

AGG, 

GAAC, 

TTTA, 



GCATTet 
TGTAA ; 



et leurs complements. 

13. Procede de determination de la presence d'un virus quelconque du papillome humain dans un echantillon £ doser, 
comprenant : 

a. I'hybridation de I'ADN dans I'echantillon a doser avec au moins un oligonucleotide consensus s6lectionn6 
dans le groupe selon la revendication 10, ledit oligonucleotide 6tant conjugue a un compose 6metteur d'un 
signal, capable de produire un signal detectable, et 

b. la determination de la presence du virus du papillome humain par detection du signal 6mis 

14. Proc6d6 de determination de la presence du virus du papillome humain de type 16 dans un echantillon a doser, 
comprenant : 

a. I'hybridation de I'ADN dans l'6chantillon a doser avec au moins un oligonucleotide selectionne dans le 
groupe selon la revendication 11, ledit oligonucleotide 6tant conjugue a un compose 6metteur d'un signal, 
capable de produire un signal detectable, et 

b. la determination de la presence du virus du papillome humain par detection du signal 6mis. 

15. Proc6d6 de determination de la presence du virus du papillome humain de type 18 dans un echantillon a doser, 
comprenant : 

a. I'hybridation de I'ADN dans I'echantillon a doser avec au moins un oligonucleotide s6lectionn6 dans le 
groupe selon la revendication 12, ledit oligonucleotide 6tant conjugue a un compose 6metteur d'un signal, 
capable de produire un signal detectable, et 

b. la determination de la presence du virus du papillome humain par detection du signal 6mis 

16. Proc6d6 selon une quelconque des revendications 13 a 15, comprenant en outre une 6tape d'amplification avant 
ou pendant ladite 6tape d'hybridation. 

17. Proc6de selon la revendication 16, dans lequel ladite 6tape d'amplification comprend la PCR ou la LCR. 



Revendications pour I'Etat contractant suivant : ES 

1. Composition utile dans la LCR pour amplifier I'ADN de virus du papillome humain present dans echantillon & doser, 
ladite composition comprenant un ensemble de quatre sondes oligonucleotidiques, lesdits ensembles de sondes 
etant seiectionnes dans le groupe constitu6 par les ensembles d'oligonucieotides suivants : 



LCR5 : n* ^identification 



81 
82 
83 
84 



GCTGCAAACA 
TTATATCATG 
TATTAGAATG 
TGCTTGCAGT 



ACTATACATG 
TATAGTTGTT 
TGTGTACTGC 
ACACACATTC 



ATATAA, 
TGCAGC, 
AAGCA, 
TAATA; 
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LCR6 : n # d'identification 

85 CTTCACTGCA 

86 TTATTTCTAT 

87 CCTGTGTATA 

88 TACTGTCTTG 



AGACATAGAA ATAA, 
GTCTTGCAGT GAA, 
TTGCAAGACA GTAT, 
CAATATACAC AGG ; 



LCR7 : n* d'identification 

89 TATATTGCAA GACAGTATTG GAAC 

90 GTTCCAATAC TGTCTTGCAA TTTA, 

91 TTACAGAGGT ATTTGAATTT GCATT, 

92 AATGCAAATT CAAATACCTC TGTAA ; et 



LCR8 : n* ^identification 

93 GTATGGAACA ACATTAGAAC AGCA, 

94 TGCTGTTCTA ATGTTGTTCC ATAC, 

95 ATACAACAAA CCGTTGTGTG ATTT, 

96 AAATCACACA ACGGTTTGTT GTAT. 



Composition selon la revendication 1 , destined a amplifier I'ADN de virus du papilloma humain de type 16 present 
dans un echantillon a doser, ladite composition comprenant un ensemble de quatre sondes oligonucleotidiques, 
lesdits ensembles de sondes etant selectionnes dans le groupe constitue par les ensembles ^oligonucleotides 
suivants : 

LCR5 (n° d'identification 81, 82, 83 et 84) et LCR8 (n° d'identification 93, 94, 95 et 96). 

Composition selon la revendication 1 , destinee a amplifier I'ADN de virus du papilloma humain de type 1 8 present 
dans un echantillon a doser, ladite composition comprenant un ensemble de quatre sondes oligonucleotidiques, 
lesdits ensembles de sondes etant selectionnes dans le groupe constitu6 par les ensembles d'oligonucl6otides 
suivants : 

LCR6 (n° d'identification 85, 86, 87 et 88) et LCR7 (n° d'identification 89, 90, 91 et 92). 

Kit de detection de la presence d'ADN de virus du papilloma humain dans un echantillon a doser, comprenant : 
une composition selon I'une quelconque des revendications 1 a 3, et en outre une ligase. 

Kit selon la revendication 4, dans lequel ladite ligase est thermostable. 

Composition utile dans la PCR pour amplifier I'ADN de virus du papilloma humain present dans un echantillon a 
doser, ladite composition comprenant : 

une premiere amorce d'acide nucleique de direction sens, capable de s'hybrider au brin antisens de I'ADN de 
HPV, ladite amorce ayant de 1 0 a environ 30 nucleotides de long et une sequence selectionnee dans le groupe 
constitud par les sequences suivantes : 
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N # d'identification 



1 


CAGATGTCTC 


TGTGGCGGCC 


TAGTG, 


6 


GAATTAGTTA 


GACCATTTAA 


AAG, 


7 


GGGGAAACAC 


CAGAATGGAT 


A, 


81 


GCTGCAAACA 


ACTATACATG 


ATATAA, 


85 


CTTCACTGCA 


AGACATAGAA 


ATAA, 


89 


TATATTGCAA 


GACAGTATTC 


GAACet 


93 


GTATGGAACA 


ACATTAGAAC 


AGCA ; et 



une deuxieme amorce d'acide nucleique de direction antisens, capable de s'hybrider au brin sens de I'ADN 
de HPV, ladite amorce ayant de 10 a environ 30 nucleotides de long et une sequence selectionnee dans le 
groupe constitue par les sequences suivantes : 

N # ^identification 

5 AGGTGTCAGG AAAACCAAAT TTATT, 

84 TGCTTGCAGT ACACACATTC TAATA, 

88 TACTGTCTTG CAATATACAC AGG, 

92 AATGCAAATT CAAATACCTC TGTAAct 

96 AAATCACACA ACGGTTTGTT GTAT; 

pour autant que lesdites premiere et deuxieme amorces s'hybrident a leurs brins respectifs antisens et sens 
a des emplacements tels que leurs extremites 3* ne se chevauchent pas et que, dans la direction d'exlension, 
les extremites 5* desdites amorces soient plus espacees que les extremites 3' desdites amorces. 

7. Composition selon la revendication 6, dans laquelle lesdites premiere et deuxieme amorces sont selectionnees 
parmi les paires suivantes de sequences oligonucleotidiques (identifies par leur numdro d'identification) : 

1 et 5, 6 et 5, 7 et 5, 81 et 84, 
B5 et 88, 89 et 92, et 93 et 96. 

8. Kit de detection de la presence d'ADN de virus du papillome humain dans un echantillon a doser, comprenant : 

une composition selon la revendication 6 ou 7 et 
en outre une polymerase. 

9. Kit selon la revendication 8, dans lequel ladite polymerase est thermostable. 

10. Procede de determination de la presence d'un virus quelconque du papillome humain dans un echantillon a doser, 
comprenant : 

a. I'hybridation de I'ADN dans I'echantillon a doser avec au moins un oligonucleotide consensus selectionne 
dans le groupe de sequences constitue par : 

N* d'identification 

1 CAGATGTCTC TGTGGCGGCC TAGTG, 

5 AGGTGTCAGG AAAACCAAAT TTATT, 

6 GAATTAGTTA GACCATTTAA AAG et 

7 GGGGAAACAC CAGAATGGAT A; 

et leurs complements, ledit oligonucleotide 6tant conjugue a un compose emetteur d'un signal, capable de 
produire un signal detectable, et 

b. la determination de la presence du virus du papillome humain par detection du signal emis. 
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11. Proc6d6 de determination de la presence du virus du papilloma humain de type 16 dans un 6chantillon a doser, 
comprenant : 

a. Thybridation de I'ADN dans l'6chantillon a doser avec au moins un oligonucleotide s6lectionn6 dans le 
groupe de sequences constitu6 par : 

N # ^identification 



81 


GCTGCAAACA 


ACTATACATG 


ATATAA, 


82 


TTATATCATG 


TATAGTTGTT 


TGCAGC, 


83 


TATTAGAATG 


TGTGTACTGC 


AAGCA, 


84 


TGCTTGCAGT 


ACACACATTC 


TAATA, 


93 


GTATGGAACA 


ACATTAGAAC 


AGCA, 


94 


TGCTGTICIA 


ATGTTGTTCC 


ATAC, 


95 


ATACAACAAA 


CCGTTGTGTG 


ATTTet 


96 


AAATCACACA 


ACGGTTTGTT 


GTAT; 



et leur& complements, 

ledit oligonucleotide 6tant conjugue a un compose 6metteur d'un signal, capable de produire un signal detec- 
table, et 

b. la determination de la presence du virus du papillome humain par detection du signal emis. 

12. Proc6d6 de determination de la presence du virus du papillome humain de type 18 dans un echantillon a doser, 
comprenant : 

a. Thybridation de I'ADN dans l'6chantillon a doser avec au moins un oligonucleotide seiectionn6 dans le 
groupe de sequences constitue par : 

N # ^identification 



85 


CTTCACTGCA 


AGACATAGAA 


ATAA, 


86 


TTATTTCTAT 


GTCTTGCAGT 


GAA, 


87 


CCTGTGTATA 


TTGCAAGACA 


GTAT, 


88 


TACTGTCTTG 


CAATATACAC 


AGG, 


89 


TATATTGCAA 


GACAGTATTG 


GAAC, 


90 


GTTCCAATAC 


TGTCTTGCAA 


TTTA, 


91 


TTACAGAGGT 


ATTTGAATTT 


GCATTct 


92 


AATGCAAATT 


CAAATACCTC 


TGTAA; 



et leurs complements, 

ledit oligonucleotide etant conjugu6 a un compose 6metteur d'un signal, capable de produire un signal detec- 
table, et 

b. la determination de la presence du virus du papillome humain par detection du signal 6mis. 

13. Proc6d6 selon une quelconque des revendications 10 a 12, comprenant en outre une etape d'amplification avant 
ou pendant ladite etape d'hybridation. 

14. Proc6de selon la revendication 13, dans lequel ladite etape d'amplification comprend la PCR ou ta LCR. . 
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